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I 

INTEGRIN ALPHA SUBUNIT CYTOPLASMIC 
DOMAIN POLYPEPTIDES, ANTIBODIES AND METHODS 



Technical Field 

The present invention relates to polypeptides 
that define the integrin a 6 and a 3 subunits, 
particularly the cytoplasmic domain of the a 6 and a 3 
subunits. In addition, the invention describes 
antibodies immunoreactive with the cytoplasmic domain 
of a 6 and ac 3 , and methods for using the antibodies and 
polypeptides in assays for detecting a 6 and a 3 
subunits in body samples. 

Background 

The integrin family of cell surface receptors 
serve cellular adhesion functions. The receptors form 
a link between the extracellular matrix and the 
cytoskeleton through their binding to various 
extracellular components. Each integrin receptor is a 
heterodimer comprised of an a and a & subunit. At 
least 11 a chains (Ruoslahti and Giancotti, 1989) and 
six & chains (Sheppard et al., 1990) have been 
recognized in man. Each a subunit tends to associate 
with only one type of 5 subunit, but there are several 
exceptions to this rule (Hemler et al., 1989; Cheresh 
et al., 1989; Holzmann et al., 1989; Freed et al., 
1989) . 

The human heterodimer VLA-6 was identified using 
the monoclonal antibody GoH3 , which is immunoreactive 
with the a 6 subunit expressed on the surface of mouse 
and human cells. Hemler et al. J. Biol. Chem. . 
263:7660-7665, (1988); and Sonnenberg et al. J. Biol, 
Chem.. , 262:10376-10383, (1987). The amino terminal 
sequence of the human VLA-6 a 6 subunit was determined 
from purified protein (Kajiji et al. EMBO J , 8:673- 
680,1989) and was used to design degenerate 



oligonucleotides for probing a cDNA library. The full 
length sequence of a 6 cDNA, and its predicted amino 
acid sequence, were elucidated subsequent to cDNA 
cloning. Tamura, et al M J. Cell Biol. . 111:1593-1604 
(1990). While Tamura et al. , supra , also discloise 
multiple cDNA sequences encoding the VLA-6 B A subunit, 
there is provided no evidence that additional VIA-6 a 6 
subunits exist* European Patent Application 
Publication Number 279,669 (published July 24, 1988) 
describes human a 6 and B 4 subunits of an integrin 
receptor and the complex they associate to form on 
pancreatic and other cancer cells. The publication 
does not describe or suggest that an isoform of the a 6 
subunit exits. 

The full length sequence of a hamster cDNA 
encoding the Gap b3 cell surface membrane glycoprotein 
was described by Tsuji et al., J. Biol. Chem. . 
265:7016-7021 (1990). Based on the predicted amino 
acid sequence and predicted overall structure, it was 
suggested that Gap b3 is the hamster homolog of the a 3 
integrin subunit. The sequence of a cDNA encoding the 
partial sequence of chicken a 3 protein was disclosed 
in Hynes et al. J. Cell Biol. . 109:409-420 (1989). 
The cytoplasmic regions of these clones do not share 
homology with the cytoplasmic region of a 3B disclosed 
herein, and are therefore assumed to encode a 3A 
subunit isoform. Furthermore, neither publication 
suggest the possibility of an a 3B subunit. 

The N-terminal amino acid sequence of human a 2 
protein is provided in European Patent Application 
Publication Number 330,506 (published July 3, 1989). 
That publication provides no suggestion that an 
isoform of the a 3 protein, namely or 3B , exists. 

Brief Summary of the Invention 



A new species of alpha (a) integrin subunit 
protein has been discovered, with representative 
members in both the a 6 and a 3 class of integrins 
corresponding to the laminin receptor and the laminin, 
collagen and f ibronectin receptors, respectively. 
Specifically, it has been discovered that new a 6 
species and a 3 species exist which differ from 
previously described a 6 and a 3 proteins in the 
cytoplasmic domain of the protein. Through a 
combination of cDNA sequencing studies and anti- 
synthetic peptide antibody immunoreactivity studies, 
it has been shown that the cytoplasmic domain of these 
new proteins, designated a 6B and a 3B , are related 
between human and mouse isolates. 

Thus the present invention describes polypeptides 
comprising an amino acid residue sequence that 
includes the amino acid residue sequence defining an 
antigenic determinant in the cytoplasmic domain of the 
human or mouse a 6B or a 3B protein. Preferably, the 
polypeptide has a sequence corresponding to the whole 
cytoplasmic domain of either the human or mouse a 6B or 
a 3B protein. Alternatively, a polypeptide can 
correspond to all or substantially all of a native 
human or mouse a 6B or a 3B subunit in substantially 
isolated form. 

The polypeptides or proteins are useful as 
immunogens for preparing polyclonal and monoclonal 
antibodies immunoreactive with the human or mouse a 6B 
or a 3B cytoplasmic domains, and as reagents for use in 
diagnostic assays for detecting the a 6B or a 3B 
proteins. 

Thus, in a related embodiment the invention 
describes polyclonal and monoclonal antibodies having 
immunospecif icities for antigenic determinants on the 
cytoplasmic domains of a 6B and a 3B proteins. These 
antibodies find use in in vitro and in situ 
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immunoassays for detecting a 6B or a 3B cytoplasmic 
domain antigens in body samples such as tissues or 
fluids. 

Another aspect of the invention is the diagnostic 
5 methods and kits therefor, for detecting or 6B or or 3B j 

cytoplasmic domain antigenic determinants using an h 
antibody of this invention. 

Other features and benefits of the invention will 
become apparent from the following detailed 
10 description and specific examples describing the 

invention, its principles and preferred embodiments. 

Brief Description of the Drawings 

In the drawings forming a portion of this 
15 disclosure: 

Figure 1 illustrates immunoprecipitation of 
polypeptides from mouse cells using antibodies 
specific for the a 6 subunit. The differentiated 
(Diff.) ESI and D3 cells are described in Example 2. 
20 Antibody GoH3 is a monoclonal antibody immunospecif ic 

for the extracellular domain of the a 6A subunit. 
Antisera 6844 was raised in rabbit against a synthetic 
peptide specific for the cytoplasmic domain of human 
a 6A . The immunoprecipitated labeled proteins were 
25 visualized by SDS-PAGE. Molecular weight, in 

kilodaltons, is noted on the side of the gel. 

Figures 2 and 3 illustrate a sequential 
immunoprecipitation analysis of a 6 subunit s in human 
JAR cell lysates as described in Example 2. NRS is 
3 0 normal rabbit preimmune sera, anti-a 6 Mab is GoH3, 

anti-a 6A is sera 6844 and anti-a 6B is sera 382. The * 
molecular weight of standard protein markers is shown 
on the right side of the gel and is expressed in 
kilodaltons (KDa) . Figure 2 shows immunodepletion 
35 with NHS or with anit^a 6 Mab, and Figure 3 shows 

immunodepletion with anti-a 6A or with anti-a 6B . 



Figure 4 shows a 6A and a 6B PCR amplification 
products visualized on an ethidium stained gel. 
Single-stranded cDNA was generated from human PG, JAR 
and U9 3 7 cells and was amplified with a set of 
primers, 1156 and 1157, specific for the human a 6A 
sequence as described in Example 3. The primers were 
also used to amplify the cloned human a 6A cDNA 
sequence, which yielded an amplification product of 
about 540 bp. The amplification products from the 
tested cell lines were either 540 bp or 410 bp, or 
both . 

Figure 5 compares the nucleotide sequences of the 
54 0 bp and 410 bp amplification products described in 
Figure 4- The 54 0 bp product shown on the top line is 
designated a 6A , and the 410 bp product shown on the 
bottom line is designated a 6B . Vertical bars denote 
where the two sequences are homologous. Horizontal 
dots denote a 130 nucleotide (nt) deletion in the a 6B 
sequence with respect to the a 6A sequence. The 13 0 nt 
deletion is in the region that encodes the a 6A 
cytoplasmic domain . 

Figure 6 provides and compares the predicted 
amino acid sequence for the ce 6 amplification products 
shown in Figure 5- The solid arrows show the location 
of the outer PCR primers ; the broken arrows show the 
location of the nested inner PCR primers. The 
underlined sequence represent the putative 
transmembrane domain. The open boxed area is the a 6A 
cytoplasmic domain; the shaded boxed area is the a 6B 
cytoplasmic domain. The bracketed area represents the 
13 0 nt sequence deleted from the a 6B sequence. 

Figure 7 depicts an ethidium bromide-stained gel 
of the PCR amplification products. generated from (A) 
undifferentiated ESI and B16 cells and (B) 
undifferentiated and differentiated ESI cells as 
described in Example 3. The same priming 
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oligonucleotides were used to amplify cDNA from these 
cells. 

_ Figure 8 provides and compares the nucleotide and 
predicted amino acid sequences for the mouse a 6 - 
amplification products shown in Figure 7. The a 6B j 
sequence is on the top line; the a 6A sequence is on * 
the bottom line. Predicted amino acid residues are 
noted below the nucleotide sequence. The solid arrows 
show the location of the PGR primers. The boxed 
regions encompass the start of cytoplasmic domain for 
the a 6A and a 6B proteins, respectively. 

Figure 9 illustrates the results of in situ 
immunostaining of diseased human kidney tissue. Panel 
A is stained with polyclonal antisera 6488 specific 
for the a 6A cytoplasmic region. Panel B is stained 
with polyclonal antisera 382 specific for the a 6B 
cytoplasmic region. 

Brief Description of the Sequences in the Sequence 
20 Listing 

The Sequence Listing is shown after the Examples 
and before the Claims. 

SEQ ID NO 1 is the 1073 residue amino acid 
sequence of the human a 6A protein. The putative 
25 transmembrane region is encompassed by amino acids 

1012-1037. The mature protein is cleaved from the 
signal sequence between amino acids 23-24. Potential 
sites of N-linked glycosylation are at positions 223, 
284, 370, 513, 731, 748, 891, 927 and 958. Putative 
30 cation binding domains are at positions 230-238 , 324- 

332, 386-394 and 441-449. The cytoplasmic sequence * 
GFFKR, which is conserved in virtually all of the 

integrin a chains, begins at amino acid position 1040. * 
The sequence encoded by the fragment of cDNA 
35 amplified using primers 1156/1157 is encompassed by 

residues 927-1073. 



10 



15 



SEQ ID NO 2 is the 5629 base nucleotide sequence 
of the human a 6A cDNA. The initiating ATG is at 
nucleotide position 147. The mature coding sequence 
begins at nucleotide position 216 and ends at position 
3365. The cytoplasmic sequence GFFKR is encoded by 
nucleotides 3264-3278. The 130 nucleotide sequence 
present in SEQ ID NO 2 but deleted from SEQ ID NO 4 is 
encompassed by nucleotides 3261-3390. The sequence of 
the a 6A cDNA amplified using primers 1156/1157 is 
encompassed by nucleotides 2924-3455. 

SEQ ID NO 3 is the 1091 residue amino acid 
sequence of the human a 6B protein. The sequence of 
SEQ ID NO 3 is identical to SEQ ID NO 1 between amino 
acids 1 and 1044. The sequence encoded by the 
fragment of a 6B cDNA amplified using primers 1156/1157 
is encompassed by residue 927 through 1060. 

SEQ ID NO 4 is the 5499 base nucleotide sequence 
of the human a 6B cDNA. The sequence of SEQ ID NO 4 is 
identical to SEQ ID NO 2 between nucleotides 1 and 
3260. Nucleotides 3261-5499 of SEQ ID NO 4 are 
identical to nucleotides 3391-5629 of SEQ ID NO 2 . 
SEQ ID NO 4 has a 13 0 nucleotide deletion in relation 
to SEQ ID NO 2. The sequence of the a 6B cDNA 
amplified using primers 1156/1157 is encompassed by 
nucleotides 2924-3325. 

SEQ ID NO 5 is the 141 amino acid sequence 
predicted from the nucleic acid product which results 
from amplification of the mouse a 6B cDNA with primers 
1157/1156. The putative transmembrane domain begins 
at amino acid 88 and continues through amino acid 113. 
SEQ ID NO 5 is identical to SEQ ID NO 7 at amino acid 
position 1 through 120; the two sequences diverge at 
amino acid 121. 

SEQ ID NO 6 is the .426 base nucleotide sequence 
corresponding to the mouse a 6B amino acid sequence in 
SEQ ID NO 5. The putative transmembrane region is 
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encoded by nucleotides 262 through 337. SEQ ID NO 6 
is identical to SEQ ID NO 8 except for 13 0 nucleotides 
present in SEQ ID NO 8 but deleted between nucleotides 
342 and 343 of SEQ ID NO 6 . - 
5 SEQ ID NO 7 is the 14 9 amino acid sequence 

predicted from the product which results from * 
amplification of the mouse or 6A cDNA with primers 
1157/1156. SEQ ID NO 7 is identical to SEQ ID NO 5 at 
amino acid positions 1 through 120; the sequences 
10 diverge at amino acid 121. 

SEQ ID NO 8 is the 556 base nucleotide sequence 
corresponding to the mouse a 6A amino acid sequence in 
SEQ ID NO 7, plus the first 109 nucleotides in the 3 1 
noncoding region. SEQ ID NO 8 is identical to SEQ ID 
15 NO 6 except it has a 13 0 base insertion (nucleotides 

342-472 of SEQ ID NO 8) between nucleotides 3 52 and 
353 of SEQ ID NO 6. 

SEQ ID NO 9 is the 153 amino acid sequence 
predicted from the product which results from 
20 amplification of the mouse cr 3B cDNA with primers 

2032/2033. The cytoplasmic sequence CDFFK begins at 
amino acid position 108. 

SEQ ID NO 10 is the 463 base nucleotide sequence 
corresponding to the mouse a 3B amino acid sequence in 
25 SEQ ID NO 9. The cytoplasmic sequence CDFFK is 

encoded by nucleotides 324-338. 

SEQ ID NO 11 is the outer 5" PCR primer 1157, 
corresponding to bp 2918-2937 of the a 6A cDNA 
sequence of Sequence ID NO 2. 
30 SEQ ID NO 12 is the outer 3" PCR primer 1156, 

corresponding to the complement of bp 3454-3473 of the * 
a 6A cDNA sequence of SEQ ID NO 2 . 

SEQ ID NO 13 is the inner 5 1 nested PCR primer 
1681, corresponding to bp 2 94 2-29 60 of the a 6A cDNA 
35 sequence of SEQ ID NO 2 . 
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SEQ ID NO 14 is the inner 3 1 nested PCR primer 
2002, corresponding to the complement of bp 3433-3452 
of the a 6A cDNA sequence of SEQ ID NO 2. 

SEQ ID NO 15 is the 5' PCR primer 2032, 
corresponding to the hamster a 3A cDNA sequence of 
Tsuji et al., J. Biol. Chem. r 265:7016-7021 (1990). 

SEQ ID NO 16 is the 3 f PCR primer 2033, 
corresponding to the hamster a 3A cDNA sequence of 
Tsuji et al., J . Biol. Chem. , 265 : 7016-7021 (1990) . 



Detaile d Description of the Invention 
A. Definitions 

Amino Acid Residue; An amino acid formed upon 
chemical digestion (hydrolysis) of a polypeptide at 
15 its peptide linkages. The amino acid residues 

described herein are preferred to be in the "L" 
isomeric form. However, residues in the "D" isomeric 
form can be substituted for any L-amino acid residue, 
as long as the desired functional property is retained 
2 0 by the polypeptide. NH 2 refers to the free amino 

group present at the amino terminus of a polypeptide. 
C00H refers to the free carboxy group present at the 
carboxy terminus of a polypeptide. In keeping with 
standard polypeptide nomenclature described in 
25 Biol. Chem., 243:3552-59 (1969) and adopted at 37 

C.F.R. 1.822(b)(2)), abbreviations for amino acid 
residues are shown in the following Table of 
Correspondence: 

3 0 TABLE OF CORRESPONDENCE 

SYMBOL AMINO ACID 

1-Letter 3 -Letter 

Y Tyr tyrosine 

G G *y glycine 

35 F ph © phenylalanine 

M Met methionine 
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Ala 


alanine 




Ser 


serine 


I 


He 


isoleucine 


Ii 


Leu 


leucine 


m 


Txl2T 


threonine 


TT 
V 


V£L-L 


valine 


TD 

±r 


PjTO 


proline 


XT 


j_iys 


lysine 


TT 
XI 


XT 4 r- 


nistidme 


Q 


Gin 


glutamine 


E 


Glu 


glutamic Ucid 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


C 


Cys 


cysteine 



It should be noted that all amino acid residue 
sequences are represented herein by formulae whose 
20 left and right orientation is in the conventional 

direction of amino-terminus to carboxy-terminus . In 
addition, the phrase "amino acid residue" is broadly 
defined to include the amino acids listed in the Table 
of Correspondence and modified and unusual amino 
25 acids, such as those listed in 37 C.F.R. 1.822(b)(4), 

and incorporated herein by reference. Furthermore, it 
should be noted that a dash at the beginning or end of 
an amino acid residue sequence indicates either a 
peptide bond to a further sequence of one or more 
3 0 amino acid residues or a covalent bond to a carboxyl 

or hydroxyl end gxoup. 

Polypeptide and Peptide : Polypeptide and peptide 
are terms used interchangeably herein to designate a 
linear series of amino acid residues connected one to 
35 the other by peptide bonds between the alpha-amino and 

carboxy groups of adjacent residues. 



Protein: Protein is a term used herein to 
designate a linear series of greater than about 50 
amino acid residues connected one to the other as in a 
polypeptide. 

Synthetic peptide : refers to a chemically 
produced chain of amino acid residues linked together 
by peptide bonds that is free of naturally occurring 
proteins and fragments thereof. 

Nucleotide ; A monomeric unit of DNA or RNA 
consisting of a sugar moiety (pentose) , a phosphate, 
and a nitrogenous heterocyclic base. The base is 
linked to the sugar moiety via the glycosidic carbon 
(l 1 carbon of the pentose) and that combination of 
base and sugar is a nucleoside. When the nucleoside 
contains a phosphate group bonded to the 3 1 or 5 1 
position of the pentose it is referred to as a 
nucleotide. A sequence of operatively linked 
nucleotides is typically referred to herein as a "base 
sequence" or "nucleotide sequence", and their 
grammatical equivalents, and is represented herein by 
a formula whose left to right orientation is in the 
conventional direction of 5 1 -terminus to 3' -terminus. 

Base Pair (bp): A partnership of adenine (A) 
with thymine (T) , or of cytosine (C) with guanine (G) 
in a double stranded DNA molecule. In RNA, uracil (U) 
is substituted for thymine. 

Nucleic Acid : A polymer of nucleotides, either 
single or double stranded. 

Polynucleotide : a polymer of single or double 
stranded nucleotides. As used herein "polynucleotide" 
and its grammatical equivalents will include the full 
range of nucleic acids. A polynucleotide will 
typically refer to a nucleic acid molecule comprised 
of a linear strand of two or more deoxyribonucleotides 
and/or ribonucleotides. The exact size will depend on 
many factors, which in turn depends on the ultimate 
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conditions of use, as is well known in the art. The 
polynucleotides of the present invention include 
primers , probes, RNA/DNA segments, oligonucleotides or 
"oligos" (relatively short polynucleotides), genes, 
5 vectors, plasmids, and the like. 

Gene ; A nucleic acid whose nucleotide sequence 
codes for an RNA or polypeptide. A gene can be either 

RNA or DNA. 

Duplex DNA ; a double-stranded nucleic acid 

10 molecule comprising two strands of substantially 

complementary polynucleotides held together by one or 
more hydrogen bonds between each of the complementary 
bases present in a base pair of the duplex. Because 
the nucleotides that form a base pair can be either a 

15 ribonucleotide base or a deoxyribonucleotide base, the 

phrase "duplex DNA" refers to either a DNA— DNA duplex 
comprising two DNA strands (ds DNA), or an RNA— DNA 
duplex comprising one DNA and one RNA strand. 

' Recombinant DNA ( :r DNA ) molecule ; a DNA molecule 

20 produced by operatively linking two DNA segments. 

Thus, a recombinant DNA molecule is a hybrid DNA 
molecule comprising at least two nucleotide sequences 
not normally found together in nature. rDNA's not 
having a common biological origin, i.e., 

25 evolutionarily different, are said to be 

"heterologous " . 

Vector ; a rDNA molecule capable of autonomous 
replication in a cell and to which a DNA segment, 
e.g., gene or polynucleotide, can be operatively 

30 linked so as to bring about replication of the 

attached segment. Vectors capable of directing the 
expression of genes encoding for one or more proteins 
are referred to herein as "expression vectors". 
Particularly important vectors allow cloning of cDNA 

35 (complementary DNA) from mRNAs produced using reverse 

transcriptase. 
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B. Integrin Alpha Subunit Polypeptides 
The present invention relates to a 
previously undescribed species of integrin alpha 
5 subunit that is derived by splicing of the messenger 

RNA in the tissue in which the integrin alpha subunit 
is expressed, such that the amino acid sequence of the 
alpha subunit polypeptide has a seguence as defined 
herein. 

10 Splicing as a form of regulation of gene 

expression is one means by which a cell regulates the 
structural gene products expressed in that cell type. 
According to the structures defined herein, it is now 
known that the a 6 and a 3 integrin subunits can each be 
15 expressed in two alternate forms (isoforms), 

designated herein as an "A" form and a »B" form 
depending upon the spliced product, and are referred 
to as a 6A or a 6B , and as a 3B or a 3B . 

The newly described a 6B and a 3B subunits contain a 
20 car boxy terminal amino acid residue seguence defining 

their cytoplasmic domain that is different from their 
a SA and a 3A counterparts. These new species of a 6B and 
a 3B provide, based on their seguence differences, 
novel polypeptide reagents based on (1) the antigenic 
25 determinants present in their cytoplasmic domains and 

(2) the structural role the cytoplasmic domain of 
these proteins play in the function of the integrins 
of which they are members . 

1. _or «, Subunit Polypeptides 
30 In one embodiment, the present invention 

contemplates a polypeptide based on the cytoplasmic 
domain of the a 6B species of the integrin a 6 subunit. 
This polypeptide has an amino acid seguence that 
includes a seguence that corresponds, and preferably 
35 is identical to, the amino acid residue seguence of 

the cytoplasmic domain of the human or mouse a 6B . 
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The cytoplasmic domain of human a 6B includes an 
amino acid residue sequence shown in SEQ ID NO 3 from 
residue 1068 to residue 1091 and of mouse a 6B has an 
amino acid residue sequence shown in SEQ ID NO 5 from 
5 residue 121 to residue 141. 

Thus, in one embodiment, the present invention 
contemplates a polypeptide having an amino acid 
residue sequence that includes at least the sequence 
shown in SEQ ID NO 3 from residue 1068 to residue 1091 

10 that defines the carboxy terminal portion of 

cytoplasmic domain of human a 6B . Preferably a 
polypeptide has an amino acid residue sequence shown 
in SEQ ID NO 3 from residue 1068 to residue 1091, and 
more preferably has an amino acid residue sequence 

15 shown in SEQ ID NO 3 from residue 104 5 to residue 

1091. In a related embodiment the invention 
contemplates the whole human a 6B protein, in a 
substantially isolated form, having a sequence shown 
in SEQ ID NO 3 from residue 1 to residue 1091. 

20 By substantially isolated is meant that the 

protein is present in a composition as a major 
constituent, typically in amount greater than 10%, and 
preferably greater than 9 0%, of the total protein in 
the composition. Human a 6B protein can be isolated by 

25 a variety of biochemical and immunological means from 

the tissue sources and cells described herein that 
contain a 6B subunit. Exemplary methods involve the 
use of a a 6B cytoplasmic domain specific antibody, 
such as 3 82 described herein, alone or in combination 

30 with the teachings of Kajiji et al. , EMBO J., 8:673- 

680 (1989) . 

In a related embodiment, the present invention 
contemplates a polypeptide having an amino acid 
residue sequence that includes at least the sequence 
35 shown in SEQ ID NO 5 from residue 121 to residue 141 

that defines a portion of the cytoplasmic domain of 



15 

mouse a 6B . Preferably a polypeptide has an amino acid 
residue sequence shown in SEQ ID NO 5 from residue 121 
to residue 141, Also contemplated is the whole mouse 
a 6B protein in a substantially isolated form that 
included a sequence shown in SEQ ID NO 5 from residue 
1 to residue 141, with the degree of isolation being 
the same as above for human a 6B . Purification 
of mouse or 6B can similarly be accomplished using the 
methods described above, and particularly using the 
murine cells described herein as a source of protein 
and an anti-peptide antibody prepared using mouse a 6B 
cytoplasmic domain-derived polypeptides. 

The native mouse a 6B subunit polypeptide is a 
protein of about 125,000 daltons in molecular weight 
when analyzed by PAFE-SDS under reducing conditions as 
described in the Examples, 

The native human q 6B subunit polypeptide is a 
protein of about 125,000 daltons in molecular weight 
when analyzed by polyacrylamide-sodium dodecyl sulfate 
gel electrophoresis (PAGE-SDS) under reducing 
conditions as described in the Examples. 

2 • _Q, » Subunit Polypeptides 
In another embodiment, the present invention 
contemplates a polypeptide based on the cytoplasmic 
domain of the a 3B species of the integrin a 3B subunit. 
This polypeptide has an amino acid sequence that 
includes a sequence that corresponds, and preferably 
is identical to, the amino acid residue sequence of 
the cytoplasmic domain of the human or mouse a 3B . 

The cytoplasmic domain of mouse a 3B has an amino 
acid residue sequence shown in SEQ ID NO 9 from 
residue 113 to residue 153. 

Thus, in one embodiment, the present invention 
contemplates a polypeptide having an amino acid 
residue sequence that includes at least the sequence 
shown in SEQ ID NO 9 from residue 113 to residue 153 
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that defines a portion of the cytoplasmic domain of 
a 3B . Preferably a polypeptide has an amino acid 
residue sequence shown in SEQ ID NO 9 from residue 113 
to residue 153 f and more preferably has an amino acid 
5 residue sequence shown in SEQ ID NO 9 from residue 1 

to residue 153. 

In a related embodiment, the invention 
contemplates the whole mouse a 3B protein, in a 
substantially isolated form having a sequence that 
10 includes the sequence shown in SEQ ID NO 9 from 

residue 1 to residue 153. The degree of isolation for 
mouse a 3B is the same as is for human a 6B above, with 
methods for preparing the mouse q 3B similarly based on 
immunoprecipitation or immunoaf f inity isolation 
15 methods using an antibody specific for mouse a 3B 

cytoplasmic domain as defined herein. 

In preferred embodiments, a polypeptide of the 
present invention comprises about 20 to 1100 amino 
acid residues, and preferably comprises about 24 to 50 
20 amino acid residues. 

Preferably, a polypeptide of this invention is 
further characterized by its ability to 
immunologically mimic an epitope (antigenic 
determinant) expressed by the cytoplasmic domain of 
25 a 6B or a 3B as defined herein. 

As used herein, the phrase "immunologically 
mimic" in its various grammatical forms refers to the 
ability of a polypeptide of this invention to 
immunoreact with an antibody of the present invention 
30 that recognizes an epitope on the cytoplasmic domain 

of a 6B or a 3B as defined herein. 

It should be understood that a subject 
polypeptide need not be identical to the amino acid 
residue sequence of or 6B or a 3B so long as it includes a 
35 sequence that provides at least one epitope within the 

cytoplasmic domain of the a 6B or a 3B subunit and is 
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able to inununoreact with antibodies of the present 
invention. 

A subject polypeptide includes any analog, 
fragment or chemical derivative of a polypeptide whose 
5 amino acid residue sequence is shown herein so long as 

the polypeptide is capable of immunologically 
mimicking a native epitope present in the cytoplasmic 
domain of a 6B or a 3B . Therefore, a polypeptide can be 
subject to various changes, substitutions, insertions, 
10 and deletions where such changes provide for certain 

advantages in its use. The term "analog" includes 
any polypeptide having an amino acid residue sequence 
substantially identical to a sequence specifically 
shown herein in which one or more residues have been 
15 conservatively substituted with a functionally similar 

residue and which displays the ability to mimic the 
cytoplasmic domain of a 6B or a 3B as described herein. 
Examples of conservative substitutions include the 
substitution of one non-polar (hydrophobic) residue 
20 such as isoleucine, valine, leucine or methionine for 

another, the substitution of one polar (hydrophilic) 
residue for another such as between arginine and 
lysine, between glutamine and asparagine, between 
glycine and serine, the substitution of one basic 
25 residue such as lysine, arginine or histidine for 

another, or the substitution of one acidic residue, 
such as aspartic acid or glutamic acid for another. 

The phrase "conservative substitution" also 
includes the use of a chemically derivatized residue 
30 in place of a non-derivatized residue provided that 

such polypeptide displays the requisite binding 
activity. 

"Chemical derivative" refers to a subject 
polypeptide having one or more residues chemically 
35 derivatized by reaction of a functional side group. 

Such derivatized molecules include for example, those 
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molecules in which free amino groups have been 
derivatized to form amine hydrochlorides, p-toluene 
sulfonyl groups , carbobenzoxy groups, t— 
butyloxycarbbnyl groups, chldroacetyl groups or formyl 
5 groups. Free carboxyl groups may be derivatized to 

form salts , methyl and ethyl esters or other types of 
esters or hydrazides. Free hydroxy 1 groups may be 
derivatized to form O-acyl or 0-al)cyl derivatives. 
The imidazole nitrogen of histidine may be derivatized 
10 to form tf-im-benzylhistidine. Also included as 

chemical derivatives are those peptides which contain 
one or more naturally occurring amino acid derivatives 
of the twenty standard amino acids. For examples: 4- 
hydroxyproline may be substituted for proline; 5- 
15 hydroxy lysine may be substituted for lysine; 3- 

methylhistidine may be substituted for histidine; 
homoserine may be substituted for serine; and 
ornithine may be substituted for lysine. Polypeptides 
of the present invention also include any polypeptide 
20 having one or more additions and/ox- deletions or 

residues relative to the sequence of a polypeptide 
whose sequence is shown herein, so long as the 
requisite activity is maintained. 

The term "fragment" refers to any subject 
25 polypeptide having an amino acid residue sequence 

shorter than that of a polypeptide whose amino acid 
residue sequence is shown herein. 

When a polypeptide of the present invention has a 
sequence that is not identical to the sequence of the 
3 0 cytoplasmic domain of or a 3B because one or more 

conservative or non-conservative substitutions have 
been made, usually no more than about 3 0 number 
percent, more usually no more than 20 number percent, 
and preferably no more than 10 number percent of the 
35 amino acid residues are substituted, except that 

additional residues can be added at either terminus 
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for the purpose of providing a "linker 11 by which the 
polypeptides of this invention can be conveniently 
affixed to a label or solid matrix, or carrier, such 
that the linker residues do not form epitopes 
expressed by the cytoplasmic domain of a 6B or a 3B as 
defined herein. Labels, solid matrices and carriers 
that can be used with the polypeptides of this 
invention are described hereinbelow. 

Amino acid residue linkers are usually at least 
one residue and can be 40 or more residues, more . of ten 
1 to 10 residues. Typical amino acid residues used 
for linking are tyrosine, cysteine, lysine, glutamic 
and aspartic acid, or the like. In addition, a 
subject polypeptide can differ, unless otherwise 
specified, from the natural sequence of an a 6B or a 3B 
cytoplasmic domain by the sequence being modified by 
terminal-NH 2 acylation, e.g., acetylation, or 
thioglycolic acid amidation, by terminal- 
carboxylamidation, e.g. , with ammonia, methylamine, 
and the like. 

When coupled to a carrier to form what is known 
in the art as a carrier-hapten conjugate, a 
polypeptide of the present invention is capable of 
inducing antibodies that immunoreact with the 
cytoplasmic domain of either human or mouse q6B or 
mouse a 3B . Where the immunogen is an a 3B -derived 
polypeptide, the induced antibodies immunoreact with 
the cytoplasmic domain of either human or mouse a 3B , 
This cross-reactivity between human and mouse 
cytoplasmic domains is shown by the disclosures 
herein. In view of the well established principle of 
immunologic cross-reactivity, the present invention 
therefore contemplates antigenically related variants 
of the polypeptides of this invention. An 
"antigenically related variant" is a subject 
polypeptide that is capable of inducing antibody 
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molecules that immunoreact with a subject polypeptide 
and with a 6B or a 3B . 

Any peptide of the present invention may be used 
in the form of a pharmaceutically acceptable salt. 
5 suitable acids which are capable of forming salts with 

the peptides of the present invention include 
inorganic acids such as hydrochloric acid, hydrobromic 
acid, perchloric acid, nitric acid, thiocyanic acid., 
sulfuric acid, phosphoric acetic acid, propionic acid, 
10 glycolic acid, lactic acid, pyruvic acid, oxalic acid, 

malonic acid, succinic acid, maleic acid, fumaric | 
acid, anthranilic acid, cinnamic acid, naphthalene 
sulfonic acid, sulfanilic acid or the like. 

Suitable bases capable of forming salts with the 
15 peptides of the present invention include inorganic 

bases such as sodium hydroxide, ammonium hydroxide, 
potassium hydroxide and the like; and organic bases 
such as mono-, di- and tri-alkyl and aryl amines (e.g. 
triethylamine, diisoplropyl amine, methyl amine, 
20 dimethyl amine and the like) and optionally 

substituted ethanolamines (e.g. ethanolamine, 
diethanolamine and the like) . 

A polypeptide of the present invention, also 
referred to herein as a subject polypeptide, can be 
25 synthesized by any of the techniques that are known to 

those skilled in the polypeptide art, including 
recombinant DNA techniques. Synthetic chemistry 
techniques, such as a solid-phase Merrif ield-type 
synthesis, are preferred for reasons of purity, 
3 0 antigenic specificity, freedom from undesired side 

products, ease of production and the like. An 
excellent summary of the many techniques available can 
i be found in J.M. Steward and J.D. Young, "Solid Phase 

Peptide Synthesis", W.H. Freeman Co., San Francisco, 
35 1969; M. Bodanszky, et al., "Peptide Synthesis", John 

Wiley & Sons, Second Edition, 1976 and J. Meienhofer, 
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"Hormonal Proteins and Peptides" , Vol. 2, p. 46, 
Academic Press (New York) , 1983 for solid phase 
peptide synthesis , and E . Schroder and K. Kubke, "The 
Peptides", Vol. 1, Academic Press (New York) , 1965 for 
5 . classical solution synthesis, each of which is 
incorporated herein by reference. Appropriate 
protective groups usable in such synthesis are 
described in the above texts and in J.F.W. McOmie, 
"Protective Groups in Organic Chemistry", Plenum 
10 Press, New York, 1973, which is incorporated herein by 

reference. 

In general, the solid-phase synthesis methods 
contemplated comprise the sequential addition of one 
or more amino acid residues or suitably protected 
15 amino acid residues to a growing peptide chain. 

Normally, either the amino or carboxyl group of the 
first amino acid residue is protected by a suitable, 
selectively removable protecting group. A different, 
selectively removable protecting group is utilized for 

2 0 amino acids containing a reactive side group such as 

lysine. 

Using a solid phase synthesis as exemplary, the 
protected or derivatized amino acid is attached to an 
inert solid support through its unprotected carboxyl 
25 or amino group. The protecting group of the amino or 

carboxyl group is then selectively removed and the 
next amino acid in the sequence having the 
complimentary (amino or carboxyl) group suitably 
protected is admixed and reacted under conditions 

3 0 suitable for forming the amide linkage with the 

residue already attached to the solid support. The 
protecting group of the amino or carboxyl group is 
then removed from this newly added amino acid residue, 
and the next amino acid (suitably protected) is then 
3 5 added, and so forth. After all the desired amino 

acids have been linked in the proper sequence, any 
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remaining terminal and side group protecting groups 
(and solid support) are removed sequentially or 
concurrently, to afford the final polypeptide. 

An a 6B or a 3B -derived polypeptide can be used, 
inter alia , in the diagnostic methods and systems of 
the present invention to detect a 6B or a 3B present in a 
body sample, or can be used to prepare an inoculum as 
described herein for the preparation of antibodies 
that immunoreact with epitopes on the cytoplasmic 
domain of either a 6B or a 3B . 

C. DNA Segments 
In living organisms, the amino acid residue 
sequence of a protein or polypeptide is directly 
related via the genetic code to the deoxyribonucleic 
acid (DNA) sequence of the structural gene that codes 
for the protein. Thus, a structural gene can be 
defined in terms of the amino acid residue sequence, 
i.e., protein or polypeptide, for which it codes. 

An important and well known feature of the 
genetic code is its redundancy. That is, for most of 
the amino acids used to make proteins, more than one 
coding nucleotide triplet (codon) can code for or 
designate a particular amino acid residue. Therefore, 
a number of different nucleotide sequences may code 
for a particular amino acid residue sequence. Such 
nucleotide sequences are considered functionally 
equivalent since they can result in the production of 
the same amino acid residue sequence in ail organisms. 
Occasionally, a methylated variant of a purine or 
pyrimidine may be incorporated into a given nucleotide 
sequence. However, such methylations do not affect 
the coding relationship in any way. 

In one embodiment the present invention 
contemplates an isolated. DNA segment that comprises a 
nucleotide base sequence that encodes a polypeptide 



that includes the amino acid residue sequence defining 
the cytoplasmic domain of a 6B or a 3B as defined herein, 
A DNA segment therefor has a nucleotide sequence 
encoding the human or mouse a 6B or mouse a 3B proteins, 
or at least encoding the cytoplasmic domain of those 
proteins. The nucleotide sequences are generally 
shown in SEQ ID NO 4 for human a 6B/ NO 6 for mouse a 6B 
and NO 10 for mouse a 3B . 

Preferred DNA segments include a nucleotide base 
sequence represented by the base sequence contained in 
SEQ ID NO 4 from base 3279 to base 3418 and defining a 
coding sequence that translates into the cytoplasmic 
domain of a 6B . Particularly preferred is a nucleotide 
base sequence represented by the sequence contained in 
SEQ ID NO 4 from base 147 to base 3418 that defines 
the a 6B integrin subunit. Corresponding nucleotide 
sequences for mouse a 6B in SEQ ID NO 6 are also 
contemplated. 

In another embodiment, preferred DNA segments 
include a nucleotide base sequence represented by the 
base sequence contained in SEQ ID NO 10 from base 339 
to base 4 63 and defining a coding sequence that 
translates into the cytoplasmic domain of a 3B . 
Particularly preferred is a nucleotide base sequence 
represented by the sequence contained in SEQ ID NO 10 
from base 1 to base 463 that defines the carboxy 
terminal portion of the a 3B integrin subunit, 
including the cytoplasmic domain of a 3B . 

In preferred embodiments, the length of the 
nucleotide base sequence is no more than about 3,000 
bases, preferably no more than about 1,000 bases. 

A purified DNA segment of this invention is 
substantially free of other nucleic acids that do not 
contain the nucleotide base sequences specified herein 
for a DNA segment of this invention, whether the DNA 
segment is present in the form of a composition 



WO 92/19647 



PCT/US92/03527 



24 



containing the purified DNA segment, or as a solution 
suspension or particulate formulation. By 
substantially free is means that the DNA segment is 
present as at least 10% of the total nucleic acid 
5 present by weight, preferably greater than 50%, and 

more preferably greater than 90% of the total nucleic 
acid by weight. 

In preferred embodiments, a DNA segment of the 
present invention is bound to a complementary DNA 
10 segment, thereby forming a double stranded DNA 

segment. In addition, it should be noted that a 
double stranded DNA segment of this invention 
preferably has a single stranded cohesive tail at one 
or both of its termini. 
!5 a DNA segment of the present invention can easily 

be synthesized by chemical techniques, for example, 
the phosphotriester method of Matteucci et al., »T . Am.., 
rhem. Soc . 103:3185 (1981). (The disclosures of the 
art cited herein are " incorporated herein by 
20 reference.) Of course, by chemically synthesizing the 

structural gene portion, any desired modifications can 
be made simply by substituting the appropriate bases 
for those encoding a native amino acid residue. 

In addition, a DNA segment can be prepared by 
25 first synthesizing oligonucleotides that correspond to 

portions of the DNA segment, which oligonucleotides 
are then assembled by hybridization and ligation into 
a complete DNA segment. Such methods are also well 
known in the art. See for example, Paterson et al. , 
30 cell , 48:441-452 (1987); and Lindley et al. , Proc. 

Natl. Acad. Sci. , 85:9199-9203 (1988), where a 
recombinant peptide, neutrophil-activated factor, was 
produced from the expression of a chemically 
synthesized gene in E. coli. 
35 a DNA expression vector of the present invention 

is a recombinant DNA (rDNA) molecule adapted for 



receiving and expressing translatable DNA sequences in 
the form of a fusion polypeptide of this invention. A 
DNA expression vector is characterized as being 
capable of expressing, in a compatible host, a 
structural gene product such as an a 6B or a 3B 
polypeptide of the present invention. 

As used herein, the term "vector 11 refers to a 
nucleic acid molecule capable of transporting between 
different genetic environments another nucleic acid to 
which it has been operatively linked. Preferred 
vectors are those capable of autonomous replication 
and expression of structural gene products present in 
the DNA segments to which they are operatively linked. 

As used herein, the term "operatively linked", in 
reference to DNA segments, describes that the 
nucleotide sequence is joined to the vector so that 
the sequence is under the transcriptional and/or 
translation control of the expression vector and can 
be expressed in a suitable host cell. 

The choice of vector to which a structural gene 
is operatively linked depends directly, as is well 
known in the art, on the functional properties 
desired, e.g., vector replication and protein 
expression, and the host cell to be transformed, these 
being limitations inherent in the art of constructing 
recombinant DNA molecules. 

In preferred embodiments, the vector utilized 
includes a prokaryotic replicon i.e., a DNA sequence 
having the ability to direct autonomous replication 
and maintenance of the recombinant DNA molecule extra 
chromosomally in a prokaryotic host cell, such as a 
bacterial host cell, transformed therewith. Such 
replicons are well known in the art. In addition, 
those embodiments that include a prokaryotic replicon 
also include a gene whose expression confers a 
selective advantage, such as drug resistance, to a 



bacterial host transformed therewith. Typical 
bacterial drug resistance genes are those that confer 
resistance to ampicillin or tetracycline. Vectors 
typically aiso contain convenient restriction sites 
for insertion : of translatable DNA sequences. Exemplary 
vectors are the plasmids pUC8, pUC9, pBR3 22, and 
pBR3 29 available from BioRad Laboratories , (Richmond, 
CA) and pPL and pKK223 available from Pharmacia, 
(Piscataway f NJ) . Also contemplated are vectors for 
expressing a DNA segment of this invention in a yeast 
or mammalian host cell. 

DNA expression control sequences include both 5 1 
and 3" elements, as is well known, to form a cistron 
able to express a structural gene product. The 5 1 
control sequences define a promoter for initiating 
transcription and a ribosome binding site operatively 
linked at the 5 1 terminus of the upstream translatable 
DNA sequence. 

To achieve high levels of gene expression in E. 
coli, it is necessary to use not only strong promoters 
to generate large quantities of mRNA, but also 
ribosome binding sites to ensure that the mRNA is 
efficiently translated. In E. coli, the ribosome 
binding site includes an initiation codon (AUG) and a 
sequence 3-9 nucleotides long located 3-11 nucleotides 
upstream from the initiation codon [Shine et al. , 
Nature, 254:34 (1975). The sequence, AGGAGGU, which 
is called the Shine-Da lgarno (SD) sequence, is 
complementary to the 3' end of E. coli 16S mRNA. 
Binding of the ribosome to mRNA and the sequence at 
the 3 1 end of the mRNA can be affected by several 
factors : 

(i) The degree of complementarity between the SD 
sequence and 3 1 end of the 16 S tRNA. 

(ii) The spacing and possibly the DNA sequence 
lying between the SD sequence and the AUG [Roberts et 
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al. , Proc. Natl, Acad. ScL USA f 76:760 (1979a); 
Roberts et al., Proc. Natl. Acad. Sci. USA r 76:5596 
(1979b); Guarente et al., Science . 209:1428 (1980); 
and Guarente et al., Cell . 20 t 54 3 (1980).] 
5 Optimization is achieved by measuring the level of 

expression of genes in plasmids in which this spacing 
is systematically altered. Comparison of different 
mRNAs shows that there are statistically preferred 
sequences from positions -2 0 to +13 (where the A of 

10 the AUG is position 0) [Gold et al., Annu. Rev. 

Microbiol . , 35:365 (1981)]. Leader sequences have 
been shown to influence translation dramatically 
(Roberts et al., 1979 a, b supra ) . 

(iii) The nucleotide sequence following the AUG, 

15 which affects ribosome binding [Taniguchi et al., J. 

Mol. Biol. , 118:533 (1978)]. 

D. Antibodies and Monoclonal Antibodies 
The term "antibody" in its various grammatical 
2 0 forms is used herein as a collective noun that refers 

to a population of immunoglobulin molecules and/or 
immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antibody 
combining site or paratope. 
25 An "antibody combining site" is that structural 

portion of an antibody molecule comprised of heavy and 
light chain variable and hypervariable regions that 
specifically binds (immunoreacts with) antigen. The 
term immunoreact in its various forms means specific 
3 0 binding between an antigenic determinant-containing 

molecule and a molecule containing an antibody 
combining site such as a whole antibody molecule or a 
portion thereof. 

The phrase "antibody molecule" in its various 
35 grammatical forms as used herein contemplates both an 
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intact immunoglobulin molecule and an immunologically 
active portion of an immunoglobulin molecule. 

Exemplary antibody molecules for use in the 
diagnostic methods and systems of the present 
5 invention are intact immunoglobulin molecules, 

substantially intact immunoglobulin molecules and 
those portions of an immunoglobulin molecule that 
contain the paratope, including those portions known 
in the art as Fab, Fab 1 , F(ab«) 2 and F(v) . 

10 Fab and F(ab f ) 2 portions of antibodies are 

prepared by the proteolytic reaction of papain and 
pepsin, respectively, on substantially intact 
antibodies by methods that are well known. See for 
example, U.S. Patent No. 4,342,566 to Theof ilopolous 

15 and Dixon. Fab 1 antibody portions are also well known 

and are produced from F(ab')2 portions followed by 
reduction of the disulfide bonds linking the two heavy 
reduction of the disulfide bonds linking the two heavy 
chain portions as with mercaptoethanol, and followed 

20 by alkylation of the resulting protein mercaptan with 

a reagent such as iodoacetamide. An antibody 
containing intact antibody molecules are preferred, 
and are utilized as illustrative herein. 

An antibody of the present invention in one 

25 embodiment is an anti-cytoplasmic a 6B domain antibody 

characterized as being capabie of immunoreacting with 
1) human a 6B , and 2) a polypeptide having a sequence 
shown in SEQ ID NO 3 from residue 1045 to residue 
1091. 

30 In another embodiment an antibody of this 

invention is an anti-cytoplasmic a 6B domain antibody 
characterized as being capable of immunoreacting with 
1) human a 6B , and 2) a polypeptide having a sequence 
shown in SEQ ID NO 3 from residue 1068-1091. 

35 in another embodiment an antibody of this 

invention is an anti-cytoplasmic a 6B domain antibody 
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characterized as being capable of immunoreacting with 
1) mouse cr 6B and 2) a polypeptide having a sequence 
shown in SEQ ID NO 5 from residue 121 to residue 141. 
In another embodiment, an anti-cytoplasmic cr 3B 
5 domain antibody is contemplated that is characterized 

as being capable of immunoreacting with 1) mouse cr 3B , 
and 2) the polypeptide having a sequence shown in SEQ 
. ID NO 9 from residue 113 to residue 153. 

Antibody immunoreactivity with antigens 
10 containing a cytoplasmic domain as defined above can 

be measured by a variety of immunological assays known 
in the art. Exemplary immunoreaction of a subject 
antibody with a 6B or a 3B polypeptides is described in 
Examples 2 and 4 . 
15 For example, immunoreaction with whole protein 

can be measured by the immunoprecipitation procedures 
described in Example 2. Immunoreaction of antibodies 
with polypeptides can be conveniently measured using 
ELISA as described in U.S. Patents No. 3/643,090; No. 
20 3,850,752; or No. 4,016,043, which are incorporated 

herein by reference, using the polypeptide in the 
solid phase, as is well know. 

An antibody of the present invention is typically 
produced by immunizing a mammal with an inoculum 
25 containing a polypeptide of this invention and thereby 

induce in the mammal antibody molecules having 
immunospecif icity for the polypeptide. Exemplary 
immunization procedures for preparing an antibody of 
this invention are described in Example 2. The 
3 0 antibody molecules are then collected from the mammal 

and isolated to the extent desired by well known 
techniques such as, for example, by using DEAE 
Sephadex to obtain the IgG fraction. 

To enhance the specificity of the antibody, the 
35 antibodies may be purified by immunoaf f inity 

chromatography using solid phase-affixed immunizing 
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polypeptide. The antibody is contacted with the solid 
phase-affixed immunizing polypeptide for a period of 
time sufficient for the polypeptide to immunoreact 
with the antibody molecules to form a solid phase- 
5 affixed immuno complex. The bound antibodies are 

separated from the complex by standard techniques. 

The antibody so produced can be used, inter alia, 
in the diagnostic methods and systems of the present 
invention to detect a 6B or a 3B subunits present in a 
10 body sample. See, for example, the methods described 

in Examples 2 and 4. 

The word "inoculum" in its various grammatical 
forms is used herein to describe a composition 
containing a polypeptide of this invention as an 
15 active ingredient used for the preparation of 

antibodies against the cytoplasmic domain of an a 6B or 
a 3B polypeptide. When a polypeptide is used in an 
inoculum to induce antibodies it is to be understood 
that the polypeptide can be used in various 
2 0 embodiments, e.g., alone or linked to a carrier as a 

conjugate, or as a polypeptide polymer. However, for 
ease of expression and in context of a polypeptide 
inoculum, the various embodiments of the polypeptides 
of this invention are collectively referred to herein 
25 by the term "polypeptide", and its various grammatical 

forms. 

For a polypeptide that contains fewer than about 
35 amino acid residues, it is preferable to use the 
peptide bound to a carrier for the purpose of inducing 

30 the production of antibodies. 

One or more additional amino acid residues can be 
added to the amino- or carboxy-termini of the 
polypeptide to assist in binding the polypeptide to a 
carrier. Cysteine residues added at the amino- or 

35 carboxy-termini of the polypeptide have been found to 

be particularly useful for forming conjugates via 



disulfide bonds* However, other methods well known in 
the art for preparing conjugates can also be used. 
Exemplary additional linking procedures include the 
use of Michael addition reaction products, di- 
aldehydes such as glutaraldehyde, Klipstein, et al., 
J. Infect. Dis., 147:318-326 (1983) and the like, or 
the use of carbodiimide technology as in the use of a 
water-soluble carbodiimide to form amide links to the 
carrier. For a review of protein conjugation or 
coupling through • activated functional groups, see 
Aurameas, et al., Scand. J. Immunol., 1:7-23 (1978). 

Useful carriers are well known in the art, and 
are generally proteins themselves. Exemplary of such 
carriers are keyhole limpet hemocyanin (KLH) , edestin, 
thyroglobulin, albumins such as bovine serum albumin . 
(BSA) or human serum albumin (HSA) , red blood cells 
such as sheep erythrocytes (SRBC) , tetanus toxoid, 
cholera toxoid as well as polyamino acids such as poly 
(D- lysine: D-glutamic acid), and the like. 

The choice of carrier is more dependent upon the 
ultimate use of the inoculum and is based upon 
criteria not particularly involved in the present 
invention. For example, a carrier that does not 
generate an untoward reaction in the particular animal 
to be inoculated should be selected. 

The present inoculum contains an effective, 
immunogenic amount of a polypeptide of this invention, 
typically as a conjugate linked to a carrier. The 
effective amount of polypeptide per unit dose 
sufficient to induce an immune response to the 
immunizing polypeptide depends, among other things, on 
the species of animal inoculated, the body weight of 
the animal and the chosen inoculation regimen as is 
well known in the art. Inocula typically contain 
polypeptide concentrations of about 10 micrograms to 
about 500 milligrams per inoculation (dose) , 



preferably about 50 micrograms to about 50 milligrams 
per dose. 

The term "unit dose" as it pertains to the 
inocula refers to physically discrete units suitable 
as unitary dosages for animals, each unit containing a 
predetermined quantity of active material calculated 
to produce the desired immunogenic effect in 
association with the required diluent; i.e., carrier, 
or vehicle. The specifications for the novel unit 
dose of an inoculum of this invention are dictated by 
and are directly dependent on (a) the unique 
characteristics of the active material and the 
particular immunologic effect to be achieved, and (b) 
the limitations inherent in the art of compounding 
such active material for immunologic use in animals, 
as disclosed in detail herein, these being features of 
the present invention. 

Inocula are typically prepared from the dried 
solid polypeptide-coft jugate by dispersing the 
polypeptide-con jugate in a physiologically tolerable 
(acceptable) diluent such as water, saline or 
phosphate-buffered saline to form an aqueous 
comp o s it i on . 

Inocula can also include an adjuvant as part of 
the diluent. Adjuvants such as complete Freund's 
adjuvant (CFA) , incomplete Freund's adjuvant (IFA) and 
alum are materials well known in the art, and are 
available commercially from several sources. 

The techniques of polypeptide conjugation or 
coupling through activated functional groups presently 
known in the art are particularly applicable. See, 
for example, Aurameas, et al., Scand. «T . Immunol. . 
Vol. 8, Suppl. 7:7-23 (1978) and U.S. Patent No. 
4,493,795, No. 3,791,932 and No. 3,839,153. In 
addition, a site directed coupling reaction can be 
carried out so that any loss of activity due to 
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polypeptide orientation after coupling can be 
minimized. See, for example, Rodwell et al. , 
Biotech. , 3 : 889-894 (1985) , and U.S. Patent No. 
4, 671, 958. 

5 One or more additional amino acid residues may be 

added to the amino- or carboxy-termini of the 
polypeptide to assist in binding the polypeptide to 
form a conjugate. Cysteine residues, usually added at 
the carboxy-terminus of the polypeptide, have been 
10 found to be particularly useful for forming con jugates 

via disulfide bonds, but other methods well-known in 
the art for preparing conjugates may be used. 

A preferred antibody of this invention is a 
monoclonal antibody. 
15 The phrase "monoclonal antibody" in its various 

grammatical forms refers to a population of antibody 
molecules that contain only one species of antibody 
combining site capable of immunoreacting with a 
particular epitope. A monoclonal antibody thus 
20 typically displays a single binding affinity for any 

epitope with which it immunoreacts. A monoclonal 
antibody may therefore contain an antibody molecule 
having a plurality of antibody combining sites, each 
immunospecific for a different epitope, e.g., a 
25 bispecific monoclonal antibody . 

A monoclonal antibody is typically composed of 
antibodies produced by clones of a single cell called 
a hybridoma that secretes (produces) but one kind of 
antibody molecule. The hybridoma cell is formed by 
30 fusing an antibody-producing cell and a myeloma or 

other self -perpetuating cell line. The preparation of 
such antibodies was first described by Kohler and 
Milstein, Nature 256:495-497 (1975), which description 
is incorporated by reference. The hybridoma 
35 supernates so prepared can be screened for the 

presence of antibody molecules that immunoreact with a 
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polypeptide of this invention, or for inhibition of 
the natural function of an a 6B or a 3B subunit. 

Briefly, to form the hybridoma from which the 
monoclonal antibody composition is produced, a myeloma 
5 or other self-perpetuating cell line is fused with 

lymphocytes obtained from the spleen of a mammal 
hyperimmunized with an antigen containing the 
cytoplasmic domain of a a or a 3B , such as is present in 
a polypeptide of this invention. The polypeptide- 
10 induced hybridoma technology is described by Niman et 

al., Proc. Nat3- Sci. . U.S.A. . 80:4949-4953 (1983), 
which description is incorporated herein by reference. 

It is preferred that the myeloma cell line used 
to prepare a hybridoma be from the same species as the 
15 lymphocytes. Typically, a mouse of the strain 129 

G1X + is the preferred mammal. Suitable mouse myelomas 
for use in the present invention include the 
hypoxanthine-aminopterin-thymidine-sensitive (HAT) 
cell lines P3X63-Ag8'. 653 , and Sp2/0-Agl4 that are 
20 available from the American Type Culture Collection, 

Rockville, MD, under the designations CRL 158 0 and CKL 
1581, respectively. 

Splenocytes are typically fused with myeloma 
cells using polyethylene glycol (PEG) 1500. Fused 
25 hybrids are selected by their sensitivity to HAT. 

Hybridomas producing a monoclonal antibody of this 
invention are identified using an immunoassay such as 
the immunoprecipitation protocol described in Example 
3. 

30 a monoclonal antibody of the present invention 

can also be produced by initiating a monoclonal 
hybridoma culture comprising a nutrient medium 
containing a hybridoma that secretes antibody 
molecules of the appropriate polypeptide specificity. 

35 The culture is maintained under conditions and for a 

time period sufficient for the hybridoma to secrete 



the antibody molecules into the medium. The antibody- 
containing medium is then collected. The antibody 
molecules can then be further isolated by well known 
techniques. 

Media useful for the preparation of these 
compositions are both well known in the art and 
commercially available and include synthetic culture 
media, inbred mice and the like. An exemplary 
synthetic medium is Dulbecco's minimal essential 
medium (DMEM; Dulbecco et al., Virol. 8:396 (1959)) 
supplemented with 4.5 gm/1 glucose, 20 mm glutamine, 
and 2 0% fetal calf serum. An exemplary inbred mouse 
strain is the Balb/c. 

The monoclonal antibodies of this invention can 
be used in the same manner as disclosed herein for 
antibodies of the present invention. 

For example, the monoclonal antibody can be used 
in the diagnostic methods and systems disclosed herein 
where formation of a 'cytoplasmic a 6B or a 3B domain- 
containing immunoreaction product is desired. 

Other methods of producing a monoclonal 
antibody, a hybridoma cell, or a hybridoma cell 
culture are also well known. See, for example, the 
method of isolating monoclonal antibodies from an 
immunological repertoire as described by Sastry, et 
al., Proc. Natl. Aca d. • Sci. . 86:5728-5732 (1989); Huse 
et al., Science, 246:1275-1281 (1989); and Mullinax et 
al, Proc « Natl . Acad . Sci . USA . 87 :8095-8099 (1990) . 

Also contemplated by this invention is the. 
hybridoma cell, and cultures containing a hybridoma 
cell that produce a monoclonal antibody of this 
invention. 

D. Diagnostic Systems 

The present invention also describes . a diagnostic 
system, preferably in kit form, for assaying for the 
presence of antigen having the cytoplasmic domain of 



a 6B or a 3B in a body sample such as a tissue, body 
fluid or the like body sample. A diagnostic system 
includes , in an amount sufficient for at. least one 
assay, a subject polypeptide and/or a subject antibody 
or monoclonal antibody, as a separately packaged 
immunochemical reagent. Instructions for use of the 
packaged reagent are also typically included. 

As used herein, the term "package" refers to a 
solid matrix or material such as glass, plastic, 
paper, foil and the like capable of holding within 
fixed limits a polypeptide , polyclonal antibody or 
monoclonal antibody of the present invention. Thus, 
for example, a package can be a glass vial used to 
contain milligram quantities of a contemplated 
polypeptide or it can be a microtiter plate well to 
which microgram quantities of a contemplated 
polypeptide have been operatively affixed, i.e., 
linked so as to be capable of being immunologically 
bound by an antibody. 

"Instructions for use" typically include a 
tangible expression describing the reagent 
concentration or at least one assay method parameter 
such as the relative amounts of reagent and sample to 
be admixed, maintenance time periods for 
reagent/sample admixtures, temperature, buffer 
conditions and the like. 

In one embodiment, a diagnostic system for 
assaying for the presence of or to quantitate a 6B or 
a 3B in a sample, such as blood, plasma or serum, 
comprises a package containing at least one a 6B or a 38 
derived polypeptide of this invention depending on 
whether a 6B or a 3B is to be detected, respectively. In 
another embodiment, a diagnostic system of the present 
invention for assaying for the presence or amount of 
or a 3B in a sample further includes an antibody 



composition of this invention. An exemplary 
diagnostic system is described in Example 4. 

In preferred embodiments, a diagnostic system of 
the present invention further includes a label or 
indicating means capable of signaling the formation of 
an immunocomplex containing a polypeptide or antibody 
molecule of the present invention. 

The word "complex" as used herein refers to the 
product of a specific binding reaction such as an 
antibody-antigen or receptor-ligand reaction. 
Exemplary complexes are immunoreaction products. 

As used herein, the terms "label" and "indicating 
means" in their various grammatical forms refer to 
single atoms and molecules that are either directly or 
indirectly involved in the production of a detectable 
signal to indicate the presence of a complex. Any 
label or indicating means can be linked to or 
incorporated in an expressed protein, polypeptide, or 
antibody molecule that is part of an antibody or 
monoclonal antibody composition of the present 
invention, or used separately, and those atoms or 
molecules can be used alone or in conjunction with 
additional reagents. Such labels are themselves well- 
known in clinical diagnostic chemistry and constitute 
a part of this invention only insofar as they are 
utilized with otherwise novel proteins methods and/or 
systems. 

The labeling means can be a fluorescent labeling 
agent that chemically binds to antibodies or antigens 
without denaturing them to form a fluorochrome (dye) 
that is a useful immunof luorescent tracer. Suitable 
fluorescent labeling agents are f luorochromes such as 
fluorescein isocyanate (FIC) , fluorescein 
isothiocyante (FITC) , 5-dimethylamine-l- 
naphthalenesulf onyl chloride (DANSC) , 
tetramethylrhodamine isothiocyanate (TRITC) , 
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lissamine, rhodaraine 82 00 sulphonyl chloride (RB 200 
SC) and the like. A description of immunofluorescence 
analysis techniques is found in DeLuca, 
•'Immunofluorescence Analysis", in Antibody. As a Tool, 
5 Marchalonis, et al. , eds. , John Wiley & Sons, Ltd. , 

pp. 189-231 (1982), which is incorporated herein by 
reference. 

In preferred embodiments, the indicating group is 
an enzyme, such as horseradish peroxidase (HRP) , 

10 glucose oxidase, or the like. In such cases where the 

principal indicating group is an enzyme such as KRP or 
glucose oxidase, additional reagents are required to 
visualize the fact that a receptor-ligand complex 
(immunoreactant) has formed. Such additional reagents 

15 for HEP include hydrogen peroxide and an oxidation dye 

precursor such as diaminobenzidine. An additional 
reagent useful with glucose oxidase is 2 , 2 ■ -azino-di- 
(3-ethyl-benzthiazoline-G-sulfonic acid) (ABTS) . 

Radioactive elements are also useful labeling 

20 agents and are used illustratively herein. An 

exemplary radiolabeling agent is a radioactive element 
that produces gamma ray emissions. Elements which 

, 12*-. 125 -r 128 x 132 T 

themselves emit gamma rays, such as I, x, J- 
and 51 Cr represent one class of gamma ray emission- 

25 producing radioactive element indicating groups. 

Particularly preferred is 125 I. Another group of 
useful labeling means are those elements such as 1X C, 
18 F, 15 0 and 13 N which themselves emit positrons. The 
positrons so emitted produce gamma rays upon 

30 encounters with electrons present in the animal's 

body. Also useful is a beta emitter, such 111 indium 
of 3 H. 

The linking of labels, i.e., labeling of, 
polypeptides and proteins is well known in the art. 
35 For instance, antibody molecules produced by a 

hybridoma can be labeled by metabolic incorporation of 



radioisotope-containing amino acids provided as a 
component in the culture medium. See, for example, 
Galfre et al., Meth. Enzvmol. . 73:3-46 (1981). The 
techniques of protein conjugation or coupling through 
activated functional groups are particularly 
applicable. See, for example, Aurameas, et al., 
Scand. J. Immunol. , Vol. 8 Suppl. 7:7-23 (1978), 
Rodwell et al., Biotech. . 3:889-894 (1984), and U.S. 
Pat. No. 4,493,795. 

The diagnostic systems can also include, 
preferably as a separate package, a specific binding 
agent. A "specific binding agent" is a molecular 
entity capable of selectively binding a reagent 
species of the present invention or a complex 
containing such a species, but is not itself a 
polypeptide or antibody molecule composition of the 
present invention. Exemplary specif ic binding agents 
are second antibody molecules, complement proteins or 
fragments thereof, S.' aureus protein A, and the like. 
Preferably the specific binding agent binds the 
reagent species when that species is present as part 
of a complex. 

In preferred embodiments, the specific binding 
agent is labeled. However, when the diagnostic system 
includes a specific binding agent that is not labeled, 
the agent is typically used as an amplifying means or 
reagent. In these embodiments, the labeled specific 
binding agent is capable of specif ically binding the 
amplifying means when the amplifying means is bound to 
a reagent species-containing complex. 

The diagnostic kits of the present invention can 
be used in an "ELISA" format to detect the quantity of 
a 6B or ct 3B subunit in a vascular fluid sample such as 
blood, serum, or plasma. "ELISA" refers to an enzyme- 
linked immunosorbent assay that employs an antibody or 
antigen bound to a solid phase and an enzyme-antigen 
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or enzyme-antibody conjugate to detect and quantify 
the amount of an antigen present in a sample. A 
description of the ELISA technique is found in Chapter 
22 of the 4th Edition of Basic and Clinical Immunolocrv 
5 by D.P. Sites et al. , published by Lange Medical 

Publications of Los Altos , CA in 1982 and in U.S. 
Patents No. 3,654,090; No. 3,850,752; and No. 
4,016,043, which are all incorporated herein by 
reference. 

10 Thus, in preferred embodiments, a polypeptide or 

an antibody of the present invention can be affixed to 
a solid matrix to form a solid support that comprises 
a package in the subject diagnostic systems. 

A reagent is typically affixed to a solid matrix 

15 by adsorption from an aqueous medium although other 

modes of affixation applicable to proteins and 
polypeptides well known to those skilled in the art, 
can be used. 

Useful solid matrices are also well known in the 

20 art. Such materials are water insoluble and include 

the cross-linked dextran available under the trademark 
SEPHADEX from Pharmacia Fine Chemicals (Piscataway, 
NJJ ; agarose; beads of polystyrene beads about 1 
micron to about 5 millimeters in diameter available 

25 from Abbott Laboratories of North Chicago, IL; 

polyvinyl chloride, polystyrene, cross-linked 
polyacrylamide, nitrocellulose- or nylon-based webs 
such as sheets, strips or paddles; or tubes, plates or 
the wells of a microtiter plate such as those made 

3 0 from polystyrene or polyvinylchloride. 

The reagent species, labeled specific binding 
agent or amplifying reagent of any diagnostic system 
described herein can be provided in solution, as a 
liquid dispersion or as a substantially dry power, 

35 e.g., in lyophilized form. Where the indicating means 

is an enzyme, the enzyme's substrate can also be 



provided in a separate package of a system. A solid 
support such as the before-described microtiter plate 
and one or more buffers can also be included as 
separately packaged elements in this diagnostic assay 
system. 

The packaging materials discussed herein in 
relation to diagnostic systems are those customarily 
utilized in diagnostic systems. 

The term "package 11 refers to a solid matrix or 
material such as glass, plastic (e.g., polyethylene, 
polypropylene and polycarbonate) , paper, foil and the 
like capable of holding within fixed limits a 
diagnostic reagent such as a polypeptide, antibody or 
monoclonal antibody of the present invention. Thus, 
for example, a package can be a bottle, vial, plastic 
and plastic-foil laminated envelope or the like 
container used to contain a contemplated diagnostic 
reagent or it can be a microtiter plate well to which 
microgram quantities of a contemplated diagnostic 
reagent have been operatively affixed, i.e., linked so 
as to be capable of being immunologically bound by an 
antibody or polypeptide to be detected. 

F. Assay Methods 

The present invention contemplates various 
immunoassay methods for determining the amount of a 6B 
or a 3B in a biological sample using a polypeptide, 
polyclonal antibody or monoclonal antibody of this 
invention as an immunochemical reagent to form an 
immunoreact ion product whose amount relates, either 
directly or indirectly, to the amount of . a 66 or a 3B in 
the sample. 

Those skilled in the art will understand that 
there are numerous well known clinical diagnostic 
chemistry procedures in which an immunochemical 
reagent of this invention can be used to form an 
immunoreaction product whose amount relates to the 



amount of a 6B or a 3B present in a body sample. Thus, 
while exemplary assay methods are described herein, 
the invention is not so limited. 

Various heterogeneous and homogeneous protocols, 
either competitive or noncompetitive, can be employed 
in performing an assay method of this invention, 
including radioimmunoprecipitation (RIP) / solid phase 
immunoassay such as ELISA, in situ immunoreaction 
assays for direct binding of antigen in tissue 
samples, and the like immunoassay protocols. 

Generally, to detect the presence of an or 6B or 
cr 3B subunit or polypeptide in a patient, an aliquot 
(i.e., a predetermined amount) of a body fluid sample, 
such as urine or a vascular fluid, namely blood, 
plasma or serum from the patient, or a tissue sample 
prepared for immunoreaction, is contacted by admixture 
(admixed) , with an antibody composition of the present 
invention to form an immunoreaction admixture. The 
admixture is then maintained under biological assay 
conditions (immunoreaction conditions) for a period of 
time sufficient for the a 6B or a 3B antigen present in 
the sample to immunoreact with (immunologically bind) 
a portion of the antibody combining sites present in 
the antibody composition to form a antigen-antibody 
molecule immunoreaction product (immunocomplex) . The 
complex can then be detected as described herein. The 
presence of the complex is indicative of or 6B or a 3B 
subunit or polypeptide in the sample. 

Maintenance time periods sufficient for 
immunoreaction are well known and are typically from 
about 10 minutes to about 16-2 0 hours at a temperature 
of about 4°C to about 45 C C, with the time and 
temperature typically being inversely related. For 
example, longer maintenance times are utilized at 
lower temperatures, such as 16 hours at 4°C, and 
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shorter times for higher temperatures, such as 1 hour 
at room temperature. 

Biological assay conditions- are those that 
maintain the biological activity of the immunochemical 
5 reagents of this invention and the or 6B or a 3B subunit 

or polypeptide sought to be assayed such that the 
reagents retain their ability to form an 
immunoreaction product. Those conditions include a 
temperature range of about 4°C to about 45 °C, a pH 
10 value of about 5 to about 9 and an ionic strength 

varying from that of distilled water to that of about 
one molar sodium chloride. Methods for optimizing 
such maintenance time periods and biological assay 
conditions are well known in the art. 
15 Determining the presence or amount of an a 6B or 

a 3B containing immunoreaction product formed by the 
above maintenance step, either directly or indirectly, 
can be accomplished by assay techniques well known in 
the art, and typically depend on the type of 
20 indicating means used. 

In a direct binding assay format for detecting 
a 6B or a 3B in a tissue sample such as a tissue section, 
the antibody is reacted with the target antigen in 
situ to form the immunoreaction complex, thereafter, 
25 the immunocomplex is detected thereby indicating the 

presence of the antigen in the tissue. Exemplary and 
preferred in situ immunoassay formats are described in 
Example 4. Alternatively , the direct binding assay 
can be practiced with a fluid body sample believed to 
30 contain a 6B or a 3B subunits or polypeptides. 

Thus, in this embodiment, the direct assay 
comprises the steps of : 

(a) admixing a tissue sample or body fluid 
sample with an antibody composition of this invention 
35 immunospecif ic for a cytoplasmic domain of either a 6B 



or a 3B as described herein to form an immunoreaction 
admixture ; 

(b) maintaining said immunoreaction admixture 
under biological assay conditions for a time period 
sufficient to form an immunoreaction product; and 

(c) detecting the presence, and preferably 
amount , of the immunoreaction product formed phase in 
step (b) , and thereby the amount of presence/amount of 
a 6B or oc 3B in the sample. 

More preferably, detecting in step (c) is 
performed by the steps of: 

(i) admixing the immunoreaction product formed 
in step (b) with an indicating means to form a second 
reaction admixture; 

(ii) maintaining the second reaction admixture 
for a time period sufficient for said indication means 
to bind the immunoreaction product formed in step (b) 
and form a second reaction product; and, 

(iii) determining the presence and/or amount of 
indicating means in the second reaction product, and 
thereby the presence of the immunoreaction product 
formed in step (b) • Particularly preferred is the use 
of a labeled second antibody, immunospecif ic for the 
first antibody, as the indicating means, and 
preferably the label is horseradish peroxidase. In 
one embodiment, it is particularly preferred to use 
(1) mouse anti-cytoplasmic domain a 6B polypeptide 
antibody in the antibody composition, and (2) goat 
anti-mouse IgG antibodies labeled with horseradish 
peroxidase as the indicating means* 

In a preferred competition assay method, the 
immunoreaction admixture described above further 
contains a solid phase having affixed thereto a solid 
phase antigen comprising an a 6B or a 3B subunit or 
polypeptide having an amino acid residue sequence that 
includes the cytoplasmic domain of a 6B or a 3B , 



respectively, of this invention. Thus, in this 
embodiment, the assay comprises the steps of: 

(a) admixing a body fluid sample with 1) an 
antibody composition of this invention and 2) a solid 
support having affixed thereto (operatively linked) an 
antigen comprising an a 6B or a 3B subunit or polypeptide 
having an amino acid residue sequence that includes 
the cytoplasmic domain of a 6B or a 3B of this invention, 
or both, to form an immunoreaction admixture having 
both a liquid phase and a solid phase; 

(b) maintaining said immunoreaction admixture 
under biological assay conditions for a time period 
sufficient to form an immunoreaction product in the 
solid phase; and 

(c) detecting the presence, and preferably 
amount, of the immunoreaction product formed in the 
solid phase in step (b) , and thereby the amount of 
presence/amount of one or both of a 6B and a 3B in the 
body fluid sample. 

In another competition assay format the 
immunoreaction admixture contains (1) a body fluid 
sample, preferably cell free, (2) an antibody of this 
invention and (3) a labeled antigen comprising the 
cytoplasmic domain of a 6B or a 3B , wherein the antibody 
is present in the solid phase, being affixed to a 
solid support, to form a liquid and a solid phase. In 
this embodiment, the admixed body fluid sample 
competes with the labeled reagent for immunoreaction 
with the solid phase antibody to form a solid phase 
immunoreaction product. Thereafter, the detection of 
label in the solid phase correlates with the amount of 
a 6B or or 3B in the admixed fluid sample. 

In one embodiment, the detection of a polypeptide 
of this invention in a body sample is utilized as a 
means to monitor the fate of therapeutically 



administered ct 6B or * 3B derived polypeptides according 
to the therapeutic methods disclosed herein. 

Also contemplated are immunological assays 
capable of detecting the presence of immunoreaction 
product formation without the use of a label. Such 
methods employ a "detection means", which means are 
themselves well-known in clinical diagnostic chemistry 
and constitute a part of this invention only insofar 
as they are utilized with otherwise novel 
polypeptides , methods and systems. Exemplary 
detection means include methods known as biosensors 
and include biosensing methods based on detecting 
changes in the reflectivity of a surface, changes in 
the absorption of an evanescent wave by optical fibers 
or changes in the propagation of surface acoustical 
waves . 

Examples 

The following Examples illustrate, but do not 
limit, the present invention. 
1. Polypeptides 

Polypeptides were synthesized using the 
classical solid-phase technique described by 
Merrifield, Adv. Enzvmol. . 32:221-96 (1969) as adapted 
for use with a Model 43 OA automated peptide 
synthesizer (Applied Biosystems, Foster City, CA) . 
Polypeptide resins were cleaved by hydrogen fluoride, 
extracted and analyzed for purity by high-performance 
liquid chromatograph using a reverse-phase C18 column. 
(Waters Associates, Milf ord, MA) . 

The amino acid residue sequence of the 
polypeptides and their designations are as follows: 
pa 6A l IHAQPSDKERLTSDA 
pa 6B l DEK YI DNLEKKQ W ITKWNRNE SYS 

Polypeptide pa 6A has a sequence from the 
cytoplasmic domain of a 6A and is shown in SEQ ID NO 1 



from residue 1059 to residue 1073 to which an 
additional cysteine residue was included at the N- 
terminus for coupling the peptide to a protein carrier 
(KLH) for immunization < Polypeptide pa 6B has a 
sequence from the cytoplasmic domain of a 6B and is 
shown in SEQ ID NO 3 from residue 1068 to residue 1091 
to which an additional cysteine residue was included 
at the N-terminus for coupling the peptide to a 
protein carrier (KLH) for immunization. 
2 . Preparation of Polyclonal Antisera 
a. Conjugation of KLH 

Briefly, as a generalized procedure for each 
polypeptide, 4 milligrams of KLH in 0.25 milliliters 
(ml) of 10 millimolar (mM) sodium phosphate buffer (pH 
7.2) is reacted with 0.7 milligrams (mg) of MBS 
dissolved in DMF, and the resulting admixture is 
stirred for 3 0 minutes at room temperature. The MBS 
solution is added dropwise to ensure that the local 
concentration of DMF was not too high, as KLH is 
insoluble at DMF concentrations of about 30% or 
higher. The reaction product, KLH-MB, is passed 
through a chromatography column prepared with Sephadex 
G-25 (Pharmacia Fine Chemicals, Piscataway, NJ) 
equilibrated with 50 mM sodium phosphate buffer (pH 
6.0) to remove free MBS. KLH recovery from peak 
fractions of the column eluate, monitored at 280 
nanometers,, is typically approximately 8 0%. 

The KLH-MB so prepared is then reacted with 5 mg 
of polypeptide dissolved in 1 ml of buffer. The pH 
value of the resulting reaction composition is 
adjusted to 7-7.5, and the reaction composition is 
stirred at room temperature for 3 hours to provide a 
polypeptide-carrier conjugate. 

b. Immunization and Harvesting of Polyclonal 
Antisera 
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Inoculum stock solutions are prepared with CFA, 
IFA or alum as follows: An amount of the synthetic 
polypeptide-con jugate sufficient to provide the 
desired amount of polypeptide per inoculation is 
dissolved in phosphate-buffered saline (PBS) at a pH 
value of 7.2. Equal volumes of CFA, IFA or alum are 
then mixed with the polypeptide solution to provide an 
inoculum containing polypeptide, water and adjuvant in 
which the water-to-oil ratio is about 1:1. The 
mixture is thereafter homogenized to provide the 
inoculum stock solution. * 

Rabbits used herein to raise anti-polypeptide 
antibodies were injected subcutaneously with an 
inoculum comprising 200 micrograms (ug) of a 
polypeptide conjugate (polypeptide plus carrier) 
emulsified in complete Freund's adjuvant (CFA) ; 200 ug 
of polypeptide conjugate, incomplete in Freund f s 
adjuvant (IFA); and 200 ug of polypeptide conjugate 
with 4 mg alum injected intraperitoneally on days 0, 
14 and 21 , respectively, of the immunization schedule. 
Each inoculation (immunization) consisted of four 
injections of the inoculum. Mice may be immunized in 
a similar way using about one tenth of the above dose 
per injection. 

Animals are typically bled 4 and 15 weeks after 
the first injection. Control re-immune serum was 
obtained from each animal by bleeding just. before the 
initial immunization. 

Control inoculum stock solutions can also be 
prepared with keyhole limpet hemocyanin (KLH) , KLH in 
IFA (incomplete Freund's adjuvant), KLH-alum absorbed, 
KLH-alum absorbed-pertussis, edestin, thyreoglobulin, 
tetanus toxoid, tetanus toxoid in IFA, cholera toxoid 
and cholera toxoid in IFA. 

Upon injection or other introduction of the 
antigen or inoculum into the host animal, the immune 



system of the animal responds by producing large 
amounts of antibody to the antigen. Since the 
specific antigenic determinant of the manufactured 
antigen; i.e., the antigen formed form the synthetic 
polypeptide linked to the carrier corresponds to the 
determinant of the natural antigen of interest, the 
host animal manufactures antibodies not only to the 
synthetic polypeptide to which the synthetic 
polypeptide antigen corresponds; i.e., to the a 6B 
protein. 

c. Immunoreactivitv of Anti-peptide Antisey a 
With Native a f Proteins 
1. Protocols and Reagents 

The rabbit polyclonal anti-a 6 
cytoplasmic domain antiserum designated 6844 was 
raised against the last 15 amino acids 
( IHAQP S DKERLTS DA ) (SEQ ID NO 1, residue 1059 to 
residue 1073) of the reported human a 6 (a 6A ) sequence 
(Tamura et al., J. Cell. Biol. . 111:1593-1604, 1990), 
to which an additional cystine residue was included at 
the N-terminus for coupling the peptide to a protein 
carrier (KLH) for immunization. 

The rat monoclonal antibody, GoH3, is specific 
for an extracellular epitope on both the human and 
murine a 6 subunits (Sonnenberg et al., J. Biol . Chem. . 
262:10376-83, 1987). The is otype-matched control 
antibody, B3B4, recognizes the B lymphocyte specific 
antigen, CD23. 

The anti^a 6 specific monoclonal antibody, 
135.13c, and the isotype matched control antibody, 
439.9b, specific for the human £ 4 integrin subunit, 
have been previously described (Kennel et al., J. 
Biol. Chem. , 264:15515-21, 1989). 

Anti-peptide antiserar to the cytoplasmic domains 
of rat a 19 chicken a 3 , human a 4 , human a 5 and human B x 
sequences were shown to be cross-reactive with the 
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respective mouse B ± integrins by immunoprecipitation 
of B16F1 melanoma, STO fibroblast and MMT carcinoma 
murine cell lines. 

Antisera to the cytoplasmic domain of human a 6B 
were prepared by immunizations of rabbits with the 
peptide pct 6B l having the sequence 

DEKYIDNLEKKQWITKWNRNESYS (SEQ ID NO 3, residue 1068 to 
residue 1091) as described above to which an 
additional cysteine residue was included at the N- 
terrainus for coupling the peptide to a protein carrier 
(KLH) for immunization* This antisera is designated 
382. 

The ES cells and B16F1 cell line were used in 
these immunoreaction studies. The ES cell line, CCE 
(Schwartzberg et al. , Science . 246:799-803, 1989) was 
initially cultured on murine embryonic fibroblasts 
(STO cells) to prevent differentiation. However, in 
order to study the expression and function of 
integrins in this ES cell line it was necessary to 
remove the STO cells from the culture system. 
Therefore, the CCE ES cell line was subcloned into LIF 
(10 3 units /ml) (Amrad Co. Australia) containing media 
(DMEM; 10% FCS, lOOmM B-mercaptoethanol, 2mM 
glutaroine) . LIF has been shown to prevent ES cell 
differentiation (Moreau et al., Nature, 336:690-92, 
1988; Smith et al., Nature, 336:688-90, 1988; Williams 
et al., Nature . 336:684-687, 1988). The sublines were 
cultured on gelatin (0.1%) coated plates. Several 
subclones were expanded and continually cultured in 
LIF containing media. The subline ESI was chosen for 
the studies described here. ESI cells were allowed to 
differentiate on gelatin (0.1%) coated plates over a 
period of 8-9 days in the absence of LIF. 

The murine B16F1 melanoma line, obtained from Dr. 
Ralph Reisfeld (Scripps Clinic, La Jolla, California), 
was derived from a C57B1/6 melanoma and cultured in 



51 



DMEM, 5% FCS, 2mM glutamine and penicillin- 
streptomycin (50 IU/ml- 50ug/ml) . ^ 

Undifferentiated ES cells (1-2 x 10 7 cells) were 
surface labeled with Na 125 I using the lactoperoxidase 
procedure (Roth, Methods Enzvmol. , 37 (B) : 223-33 , 
1975) . Differentiated ES cells proved to be 
significantly more fragile than undifferentiated ES 
cells and did not survive the more rigorous washing 
steps required during the iodination procedure. 
Therefore, differentiated ES cells were metabolically 
labeled with [ 35 S] methionine as described previously 
by Kajiji et al, EMBO J. , 8:673-680 (1989). 
Preparation of non-ionic detergent cell lysates, 
immunoprecipitations and analysis by SDS-PAGE were 
performed as described by Kajiji et al (1989), supra , 

Immunoprecipitation is conducted generally by 
first admixing the rabbit polyclonal antisera produced 
above with a cell lysate and maintaining the admixture 
for a time period sufficient for immunocomplexes to 
form. Thereafter, the immunoabsorbent Pansorbin 
(Sigma Chemical Co., St. Louis, MO) is added to the 
admixture containing the immunocomplexes and 
maintained to allow the Pansorbin to complex with 
(bind) the immunocomplex. Thereafter the Pansorbin- 
containing bound immunocomplexes are removed from the 
lysate admixture by centrif ugation, washed several 
times and the washed immunocomplexes are released from 
the Pansorbin and analyzed by SDS-PAGE. 

Sequential immunoprecipitation was also performed 
to identify the presence of multiple immunoreactive 
species in a single lysate. After a first 
immunoprecipitation as above the lysate is retained 
and subjected to a second immunoprecipitation with 
unbound Pansorbin. The resulting lysate from the 
second immunoprecipitation is again retained . and 
subjected to a third immunoprecipitation with unbound 
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Pansorbin. Thus by the successive rounds of 
sequential immunoprecipitation of a lysate using the 
same antibody species, that lysate becomes depleted of 
antigen immunoreactive with that antibody species. 
5 Thereafter, the depleted lysate is divided into 

aliquots and each aliquot is separately 
immunoprecipitated (re-IP or re-immunoprecipitated) 
using different antibodies. Antigens in the depleted 
lysate that immunoprecipitate with the second antibody 

10 different from the depleting first antibody are not 

immunoreactive with the first antibody. By sequential 
immunoprecipitation, two non-cross reacting antigen 
species can be identified. As described herein, the 
cytoplasmic domains of a 6A and a 6B are not cross 

15 reactive . 

Mouse etc 

Separate immunoprecipitations were carried 
out on undifferentiated murine ESI cells with 
antiserum 382 raised against a synthetic peptide 

20 corresponding to the sequence of the cytoplasmic tail 

of human a 6B , with control preimmune serum from the 
same rabbit, and with antisera 6844 directed to the 
cytoplasmic tail of human 6A. Only antisera 382 
precipitated protein bands virtually identical to 

25 those reactive with anti-a 6 monoclonal GoH3 which is 

specific for both a 6A and a 6B . These data indicated 
that ESI cells do express a 6B protein, probably 
complexed with B 1# and antisera 3 82 is capable of 
recognizing the a 6B protein. 

3 0 In contrast to the immunoprecipitation data from 

undifferentiated ESI cells, the anti-a 6A cytoplasmic 
domain polyclonal antiserum, 6844, could 
immunoprecipitate the a 6A isoforra from 35 S-methionine 
labelled lysates obtained from differentiated ESI 

35 cells. Thus, differentiation of ESI cells is 
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accompanied by the induction of expression of the a 6A 
isoform. 

The absence of the a 6A isoform in differentiated 
ESI cells can be seen by immuhoprecipitations using 
5 GoH3 or 6844, which is shown in Figure 1. Whereas the 

GoH3 antibody detects the lower molecular weight 
species corresponding to a 6 , the 6844 antibody, 
immunospecif ic for a 6A , does not detect any a 6 species, 
indicating that the GoH3 -reactive form is an isoform, 
10 namely a 6B . 

Similar immunoprecipitation assays were carried 
out on the D3 ES cell line (see Figure l) . The D3 
embryonic stem cell line was derived by Doetschman et 
a l-# J . Embrvol . Exp . Moroh . . 87:27-45, (1985). D3 
15 cells were cultured in LIF containing medium as 

described above except that 15% FCS was used. 
Immunoprecipitations of [ 35 S]methionine-labelled 
lysates showed that the a 6B isoform is expressed at 
the protein level in* both undifferentiated and 
20 differentiated D3 cells while the <* 6A isoform was 

found only in the differentiated cells. This would 
suggest that the ability to switch on a 6A expression 
upon differentiation may be a general property of ES 
cells. 

25 Because the 3 82 antisera was raised to a human 

a 6B cytoplasmic domain-derived polypeptide and yet is 
shown above to immunoreact with the mouse a 6B protein , 
the above data ailso shows that an anti-a 6B antibody, 
whether raised to human or mouse varieties of ct 6B can 

3 0 be used to immunoreact with both human or mouse a 6B . 

Human a g 

Antisera 382 to a synthetic peptide 
corresponding to the last 2 5 residues of human a« 
immunoprecipitated from radiolabeled detergent lysates 
3 5 of the human choriocarcinoma cell line JAR (see 

Example 4 for description of JAR cells) a pattern of 
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bands similar or identical to those obtained with 
6844, an anti-peptide antiserum to the a 6A cytoplasmic 
domain, and GoH3 , a monoclonal antibody to the 
extracellular domain of a 6 (see Figure 2) . The bands 
5 corresponded in molecular weight to a 6 , B x and B*, and 

were positively identified as such with specific 
antibodies. This result is compatible with JAR cells 
expressing both cr 6 B x and cc 6 & A heterodimers , and with 
PCR amplifications detecting both a 6X and a 6B isoform 
10 bands in JAR cells (see Example 3) . 

Sequential immunoprecipitations (Figure 3) showed 
that antibody GoH3 completely depleted the JAR lysates 
of antigen reactive with antisera 3 82 (anti-a 6B ) or 
6844 (anti-a 6A ) . The 382 antiserum did not remove any 
15 material reactive with 6844, and 6844 did not remove 

any 382-reactive material, while both antisera 
reduced, but did not completely remove GoH3 reactivity 
(Figure 2) . These results indicate that JAR cells 
express both a 6A and cr 6B proteins, each of which is 
20 paired with either B 1 or B 4 . These results also 

indicate that antisera raised to a mouse protein, 
namely the cytoplasmic domain of mouse oc 6B , 
immunoreacts with its human counterpart protein, human 

a 6B- 

25 

3 • Identification and Cloning of g g p and g 3H 
cDNAs 

cDNA molecules encoding human and mouise a 6B and 
mouse cc 3B cytoplasmic regions were prepared and 

3 0 fragments of the cDNA molecules were selectively 

amplified using the polymerase chain reaction (PCR) in 
the presence of specific oligonucleotide primers in 
order to characterize gene expression of the a 6B and 
a 3B proteins. 

35 a. Procotols 
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Poly-A+ mRNA was isolated from human JAR 
cells (American Tissue Type Collection, ATCC, 
Bethesda, MD, ATCC HTB 144), human U937 cells (ATCC 
CRL 1591), human FG cells (Dr. P. Meitner, Brown 
University) and both differentiated and 
undifferentiated cell lines using the Invitrogen 
Fastrack Kit (Invitrogen, La *Tolla, Ca.). Single 
stranded cDNA was synthesized from 10 ug of mRNA using 
AMV reverse transcriptase (20U; Molecular Genetics 
Resources, Tampa, Fl.) and one ug of random hexamer 
primers (Pharmacia) . The cDNAs were extracted with 
phenol/chloroform, then ethanol precipitated and about 
0.5 to 10 ug cDNA was resuspended in 50-70ul of 
sterile water. 

One ul of the resuspended cDNA was amplified per 
50 ul PCR reaction mixture (2.5mM MgC12, 50mM KC1, 
lOmM 13-mercaptoethanol, 66mM Tris.HCl; pH8.3) using 
0.1 uM oligonucleotide primers, 0.25mM each of dATP, 
dTTP, dCTP, and dGTP^ and 1.25U of TAQ 1 polymerase 
(AmpliTaq; Perkin Elmer/Cetus, Ca.). The PCR program 
consisted of 2 steps: (a) 40 cycles of 1 min at 94°C, 
2 min at 55°C, and 3 min at 72°C with a 5 sec/cycle 
extension on the 72°C segment, (b) 10 min at 72°C and 
a final shift to 4°C. Second round PCR was carried out 
on one ul of the reaction mixture generated from the 
first round PCR. 

Nested pairs of PCR primers were employed to 
ensure that a 6 specific fragments were amplified. Both 
sets of a 6 primers were derived from the human a 6A cDNA 
sequence as determined by Tamura et al. . J. Cell. 
Biol. , Ill; 1593-1604 . (1990). The first set 
corresponded to bp 2918-2937 (primer 1157) and 3454- 
3473 (primer 1156) of the human a 6A sequence while the 
nested primer pair corresponded to bp 2942-2960 
(primer 1681) and 3433-3452 (primer 2002) . The 
sequence of these four primers are shown in SEQ ID NOS 



11-14, respectively. a 3 PCR primers designated primer 
2032 and 2 03 3 were derived from the hamster cDNA 
sequence as determined by Tsuji et al., J . Biol, 
Chem. , 265:7016-7021 (1990). The sequence of primers 
203 2 and 2 03 3 are shown in SEQ ID NOS 15 and 16, 
respectively. 

Oligonucleotide primers were chemically 
synthesized by using a "Gene Assembler" automated 
synthesizer (Pharmacia, Piscataway, NJ) . 

Amplified fragments- from first round PCR were 
purified using Gene Clean (Bio 101, La Jolla, Ca) , 
treated with DNA polymerase I and T4 polynucleotide 
kinase, again purified with Gene Clean, and the blunt- 
ended fragments were subcloned into the EcoRV site of 
Bluescript-pKS+ (Stratagene, La Jolla, Ca) . Clones 
containing insert were sequenced manually (Sequenase 
kit; USB, Cleveland, Oh) according to the 
manufacturer's instructions using T3 and T7 polymerase 
vector primer sequences. Sequences were analyzed on a 
VAX-VMS, version 5.2 computer, with programs of the 
University of Wisconsin Genetics Computer Group 
(Devereux et al, Nucl. Acids Res 12 ; 387-395 , 1984). 

b. Results 

i. Human a fi 

Oligonucleotides 1156 and 1157 flanking 
the 3 1 end of the coding region of integrin a 6 mRNA 
were used as primers in polymerase chain reactions 
(PCR) . Two products, of 540 bp and 410 bp, were 
obtained using first strand cDNAs from various cell 
lines as templates (Figure 4) . These same products 
were obtained in second-round PCR with a nested set of 
primers, indicating their specificity. 

Both the 540bp and the 410bp PCR 
products were subcloned and sequenced. The nucleotide 
sequence of the 54 0 bp fragment (designated a 6x in 
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Figure 5) matches exactly the sequence of ar 6 mRNA 
recently reported by Tamura et al., J . Cell , Biol. . 
Ill: 1593-1604 , 1990, and encodes the 3 1 portion of the 
end of the extracellular domain, the transmembrane and 
5 the cytoplasmic domains, followed by the initial part 

of the 3 1 untranslated region (3 • UT) . 

The sequence of the 410 bp band matches 
the 54 0 bp sequence, with the exception of a 13 0 bp 
gap shown in the lower sequence of Figure 5, which 
10 lower sequence corresponds to the nucleotide sequence 

of a 6B . This gap corresponds to the region encoding 
the predicted a 6A cytoplasmic domain, from the 
boundary with the transmembrane domain to 25 bp past 
the stop codon. Without this 130 bp segment, however, 
15 the open reading frame continues in the previous 3 • 

UT, resulting in an a 6B protein with an alternative 
cytoplasmic domain (Figure 6) . This alternative domain 
is 17 amino acid longer than, and bears no sequence 
homology with, the reported a6A cytoplasmic domain, 
2 0 but it does contain the sequence GFFKR, a motif 

present at the upstream border of all mammalian 
integrin a chains sequenced. For convenience, the 
published a 6 sequence is referred to as a 6A , and the a 6 
having the isoform cytoplasmic domain identified 
25 herein is referred to as a 6B . 

ii. Mouse a fi 

Amplification of mouse a 6 cDNA 
expressed by undifferentiated ESI and B16F1 cells was 
performed on first strand cDNA derived from these 
30 mouse cells using the polymerase chain reaction (PGR) . 

The nested sets of PCR primers, pairs 1157/1156 and 
1681/2002 described above, were employed. Figure 7A 
shows the PCR products obtained. 

The PCR fragment amplified from B16F1 ( 11 Bl 6 11 ) 
35 cDNA correspond to the size expected (510 bp) for the 

murine homologue of the human a 6 (Figure 7A; lane 2) . 
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However, the PCR fragment obtained from the 
amplification of the ESI cell cDNA was significantly 
smaller. Amplification of cDNAs derived from four 
independent ESI mRNA preparations yielded only the 
smaller fragment and never the larger fragment 
amplified from B16F1 cDNA. 

The PCR fragments from the ESI and B16F1 cells 
were subcloned into the Bluescript-pKS+ vector and 
sequenced. Figure 8 shows the nucleotide sequences of 
the two PCR fragments. The sequence of the larger 
B16F1 fragment was shown to he 89% identical to the 
human a 6A sequence at the nucleotide level and 91% 
identical at the amino acid level, Tamura et al., J. 
Cell. Biol. . 111:1593-1604 (1990). Thus the larger 
fragments sequence represents the murine homologue of 
the human a6A subunit. The B16F1 PCR fragment (Figure 
8) encodes the C-terminal portion of the extracellular 
domain as well as the transmembrane and cytoplasmic 
domains of the cr 6 subunit. Due to the selection of 
primers, additional coding sequences 3' to the 
terminus of the sequence shown in Figure 8 were not 
detected. Thus, additional amino acid residues not 
shown in Figure 8 are present in the native mouse or 6B 
protein. 

The sequence of the smaller PCR fragment (Figure 
8) was identical to the B16F1 sequence except that an 
internal deletion of 13 0 bp was observed. The 
location of the 13 0 bp deletion observed in the ESI a 6 
PCR fragment exactly matched that of the human ct^ 
sequence. Therefore, ESI cells expressed the murine 
equivalent of the a 6B isoform. 
iii. Mouse a 3 

Expression of the a 3 isoforms was also 
investigated in various mouse tissues including 
muscle, heart, brain, lung and ovary. Using the PCR 
procedure described above with the hamster a 3 primers, 
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a larger band corresponding to a 3A was amplified from 
most tissues except heart, kidney, liver, thymus and 
spleen (Table 2;Example 4). A smaller band 
corresponding to or 3B was detected in heart and brain. 
Cloning and sequencing of these bands showed that the 
larger band corresponds exactly to the reported cr 3 
sequence (a 3A ) , while the smaller band lacks a 144 bp 
segment and, like a 6B , encodes an a 3 with an 
alternative cytoplasmic tail (ct 3B ) . The amino acid 
residue and nucleotide sequences of the mouse a 3B 
cDNA-derived PCR fragments are shown in SEQ ID NQS 9 
and 10, respectively. 

4 . Tissue distribution of tt 6 A , or G B , a 3 A and g jB 
a. PCR Amplification 

The distribution of the a 6 isoforms in 
cultured cell lines and mouse tissue was assessed by 
PCR as described in Example 3. The majority of the 
cells tested contained both a 6A and a 6B mRNA (see 
Tables 1 and 2). However, the two isoforms were 
reproducibly found at ratios characteristic of a cell 
line. Interestingly, two carcinoma cell lines and 
three lines of mouse embryonic fibroblasts 
(immortalized, non-tranf ormed) contained exclusively 
a 6At w hile embryonic stem cells and F9 teratocarcinoma 
cells contained exclusively a 6B (Table 1) . 

TABLE 1 



CELL LINE 


CELL TYPE 


«6A 




*6B 


«3A 


FG 


Pancreatic Carcinoma 




> 




+ 


1320 Met 


ti 


+ 


> 




ND 


Panc-1 


ti 




> 


+ 


+ 


SGR 


it 


+ ■ 


> 


+ 


ND 


JAR 


Choriocarcinoma 


4- 


< 


+ 


+ 


JEG-3 


it 


+ 


< 




ND 
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10 



15 



BeWo 


ft 




< 


+ 


ND 


ND 


LoVo 


Colon Carcinoma 




< 


+ 


ND 


ND 


Colo 396 


it 




< 


+ 


+ 


- 


CaCo— 2 




+ 




+ 


+ 


- 








> 




+ 


'■ — 


HeLa 


Cervical Carcinoma 




> 




ND 


ND 


UCLA— P 3 


Liing Carcinoma 


+ 


> 




+ 


- 


A431 


Epidermoid Carcinoma 


+ 






ND 


ND 




Erythroleukemia 






+ 


ND 


ND 


U937 


Histiocytic Lymphoma 


+ 


> 


+ 


+ 




804G(Rat) 


Bladder Carcinoma 


+ 






+ 




3T3 (M) 


Embryonic Fibroblast 


+ 






+ 




F9 (M) 


Teratocarcinoma 






+ 


+ 




ES (M) 


Embryonic Stem 






+ 


+ 




ES (M) 


(Differentiated) 


+ 




+ 


ND 


ND 



Cells were analyzed by PCR amplification of ct 6 and a 3 
isoforms using the following human or mouse cells, with the 

20 cell sources indicated in parenthesis: pancreatic carcinoma: 

FG, SGR and 1320 Met cells (Dr. P. Meitner, Brown 
University) ; Panc-1 cells (ATCC CRL 1469) ; choriocarcinoma: 
JAR cells (ATCC HTB 144); JEG-3 cells (ATCC HTB 3 6); BeWo 
cells (ATCC CCL 98) ; colon carcinoma: LoVo cells (ATCC CCL 

25 229 ) ; Colo 396 cells (Dr. T. Edgington, Scripps Clinic, La 

Jolla, CA); CaCo-2 cells (ATCC HTB 37); HT-29 cells (ATCC 
HTB 38); Hela cervical carcinoma cells (ATCC CCL 2); UCLA- 
PS lung carcinoma cells (L. Walker, Scripps) ; A431 
epidermoid carcinoma cells (ATCC CRL 1555) ; K562 

30 erythroleukemia cells (ATCC CCL 243) ; U937 histiocytic 

lymphoma cells (ATCC CCL 1593); 8049 rat bladder carcinoma 
cells (J. Jones, Northwestern University, Evanston, IL) ; 
NIH/3T3 mouse embryo fibroblasts (ATCC CRL 1658); F9 mouse 
teratocarcinoma cells ATCC CRL 1720) ; ES mouse embryonic 

35 stem cells (E. Robertson, Columbia University, NY) . 
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Table 1 illustrates the distinction of a 6A and a 6B , and 
or 3A and a 3B subunit-encoding mRNAs in human and mouse 
cultured cell lines. PCR amplification was performed on 
single-stranded cDNA generated from each cell type, using 
5 oligonucleotides specific for the a 6 or a 3 subunit, 

respectively. The (+) symbol represents the presence of 
subunit-specif ic amplification product in the tested sample, 
the (-) symbol represents its absence, and (ND) indicates 
that analysis was not conducted on that tissue type. The 

10 (>) is used when the a 6A subunit mRNA predominates over the 

a 6B subunit mRNA, and the (<) symbol is used when the a 6B 
subunit mRNA is the predominant species in the tissue. 

By the same PCR assay, normal mouse lung, liver, 
spleen and cervix tissues were solely a 6A , brain, 

15 ovary and kidney were solely a 6B , while all other 

tissues tested contained both a 6 isoforms (Table 2) . 

TABLE 2 



TISSUE 








ct 3A 




Muscle 




> 








Heart 


+ 


> 


+ 




+ 


Kidney 






+ 






Liver 












Brain 






+ 


+ 


< + 


Lung 


+ 






+ 




Stomach 


+ 


< 


+ 


ND 


ND 


Intestine 


+ 


< 




ND 


ND 


Cervix 








ND 


ND 


Submax 






+ 


ND 


ND 


Ovary 






+ 


+ 




Thymus 


+ 


> 








Spleen 


+ . 











35 
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Table 2 illustrates the distribution of a 6A and B 
and a 3A and B subunit-encoding mRNAs in mouse tissues. 
The symbols in Table 2 are the same as in Table 1. 

Primary and nested PCR reactions were carried out 
5 on differentiated cell lines as described in Example 

3. ESI cells were allowed to differentiate over a 
period of 8-9 days in the absence of Leukemia 
Inhibitory Factor (LIF) . The morphology of the 
differentiated cells was dramatically different from 

10 that of undifferentiated ESI cells maintained in LIF. 

PCR amplification on cDNA from undifferentiated ESI 
cells, using a 6 specific primers, produced the 380 bp 
fragment corresponding to the a 6B cytoplasmic sequence 
(Figure 7B, lane 1) . However, similar amplification of 

15 cDNA from the differentiated cells produced two 

distinct fragments of 510 bp and 380 bp (Figure 7B, 
lane 2), shown by nucleotide sequencing to be the a 6A 
and a SB isoforms, respectively. 

20 b. In Situ Immunostainina of Tiss ues to Detect 

Tissue Distribution 

Kidney biopsy materials were obtained 
by percutaneous needle biopsies using modified Vim- 
Silverman needles in patients with glomerulonephritis. 

25 A small portion of kidney biopsy materials were fixed 

with 4% paraformaldehyde for 4 hours at 4°C and 
embedded in paraffin using an automatic processor 
(Tissue-Tek* Rotary Tissue Processor). The tissue was 
cut in 4 micron thickness with an AO rotary microtome, 

30 and deparaf inized with xylene or Histoclear (Baxter) 

and rehydrated with graded alcohol. The rehydrated 
sections were washed with 0.1 M glycine in TBS (0.005 
M Tris-HCl; 0.9% NaCl, pH 7.5) for 5 minutes, treated 
with 0.1% Triton X-100 for 2 min at room temperature 

35 (RT) , and trypsinized (0.1% trypsin for 5 min at 

37 °C) . Nonspecific binding sites were saturated by a 
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blocking solution (5% dry milk solids, l% heat 
inactivated horse serum in TBS) for 30 minutes. 
Serially diluted primary antibodies [1 ug/ul of 33% 
saturated ammonium sulfate (SAS) cut of antisera 6844 
and 382; 1:10 to 1:1000 dilutions in reagent diluent: 
i.e. 2.5% bovine serum albumin in TBS] and normal 
control (SAS cut of normal rabbit serum, lug/ul, 1:10 
to 1:1000 dilution in reagent diluent) were incubated 
on sections in humidified chambers at room temperature 
for 1 hr. Tissue sections were further incubated with 
peroxidase conjugated goat anti-rabbit antibody 
(Jackson Immunology) in reagent diluent (1:200) for 30 
minutes. After this, 0.02% AEC (3-Amino-a- 
ethylcarbozole, Aldrich) was applied for 3 0 minutes at 
room temperature. Each step was followed by 3 -minute 
washes in 0.005 M Tris-HCl, 0.9% NaCl pH 7.5 (TBS 
wash) . Washed tissue sections were counterstained 
with Mayor's hematoxylin for 3 0 sec, mounted in Gel 
Mount (Biomeda) and* observed under a light 
microscope. 

The results of the in situ immunostained kidney 
sections using the anti-peptide antisera specific for 
a 6A (6844) or specific for a 6B (382) are shown in 
Figure 9. Panel A shows anti-a 6A antibody molecules 
staining podocytes in the glomerular structure of the 
kidney but no staining in the tubules of the kidney. 
Panel B shows anti-a 6B antibody molecules staining the 
epithelial cells of the distal or collecting tubules 
of the kidney but not the glomerular cells. 

Additional kidney samples were similarly analyzed 
and the results are shown in Table 3 . 
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TABLE 3 1 



10 



35 



Patient Sera 6488 Sera 382. 

G T G T 

Normal - _ ~ ~ 

1 ' + + 

2 ++ ± ++ 

3 +++ - - +++ 

4 +++ ++++ 

5 '++++ - - ++++ 



1 »G n indicates the immunostaining pattern observed in 
15 glomerular epithelial cells of the kidney, whereas »T" 

indicates the immunostaining pattern observed in the 
tubular epithelial cells, where (-) indicates no 
staining and + to ++++ indicates increasing 
intensities of stain. Patients 1-5 are patients that 
20 all have glomerular nephritis clinically indicated as 

to require kidney biopsy. 

The results in Table 3 indicate that in all 
patients exhibiting symptoms of kidney dysfunction a 
distinct staining pattern is observed, namely that 
25 antisera immunospecif ic for a 6X cytoplasmic domain 

(6488) reacts with glomerular cells and antisera 
iramunospecific for 2246B cytoplasmic domain (382) 
reacts with the tubular epithelial cells. 

These results show that antibodies immunbreactive 
30 with the cytoplasmic domain of a 6B are useful for 

distinguishing cell types in kidney sections, and 
particularly to identify distal and collecting tubular 
epithelial cells in patients having conditions of 
kidney dysfunction such as glomerular nephritis. 



Although the present invention has now been 
described in terms of certain preferred embodiments, 
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and exemplified with respect thereto, one skilled in 
the art will readily appreciate that various 
modifications, changes, omissions and substitutions 
may be made without departing from the spirit thereof. 
5 it is intended, therefore, that the present invention 

be limited solely by the scope of the following 
claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Tamura, Richard N. 

Quaranta , Vito 

rii) TITLE OF INVENTION: INTEGRIN ALPHA SUBUNIT CYTOPLASMIC 
DOMAIN POLYPEPTIDES. ANTIBODIES AND METHODS 

(iii) NUMBER OF SEQUENCES: 16 

Civ) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Office of Patent Counsel, TSRI | 

(B) STREET: 106 66 North Torrey Pines Road, Mail Drop TPC8 

(C) CITY: La Jolla 

(D) STATE: California 

(E) COUNTRY : United States 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

CD) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

CA) APPLICATION NUMBER: PCT/US92/ 

CB) FILING DATE: 04-MAY-1992 
(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/695,564 

(B) FILING DATE: 03-MAY-1992 

Cviii) ATTORNEY/AGENT INFORMATION: 

CA) NAME: Fitting, Thomas 

CB) REGISTRATION NUMBER: 34,163 

CC) REFERENCE/DOCKET NUMBER: BEC0010P 

Cix) TELECOMMUNICATION INFORMATION: 
CA) TELEPHONE: 619-554-2937 



C2) INFORMATION FOR SEQ ID N0:1: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 1073 amino acids 

CB) TYPE: amino acid 
CD) TOPOLOGY: unknown 
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MOLECULE TYPE: protein 



(ili) HYPOTHETICAL: NO 

(iy) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: Domain 

(B) LOCATION: 1012.. 1037 

(D) OTHER INFORMATION: /label- TRANSMEMBRANE 



/note= "The putative transmembrane region is 
encompassed by amino acids 1012-1037." 



(ix) FEATURE: 

(A) NAME/KEY: Cleavage-site 

(B) LOCATION: (23*24) 

(D) OTHER INFORMATION: /note- "The mature protein is 



cleaved from the signal sequence between amino 
acids 23-24." 



(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 223 

(D) OTHER INFORMATION : /label- GLYCOSYLATION 



/note« "Potential site of N-linked glycosylation." 



/note» "Potential site of N-linked glycosylation." 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- s ite 

(B) LOCATION: 370 

(D) OTHER INFORMATION: /labels GLYCOSYLATION 



/note- "Potential site of N-linked glycosylation." 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 513 

(D) OTHER INFORMATION: /label= GLYCOSYLATION 



/note« "Potential site of N-linked glycosylation." 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 731 

(D) OTHER INFORMATION: /label** GLYCOSYLATION 



/note- "Potential site of N-linked glycosylation." 



(ix) 



FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 284 

(D) OTHER INFORMATION: /label* 



GLYCOSYLATION 
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(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 748 

CDS OTHER INFORMATION: /label- GLYCOSYLATION 

/note- "Potential site of N-linked glycosylation. « 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 891 

CD) OTHER INFORMATION: /label- GLYCOSYLATION 

/note* "Potential site of N-linked glycosylate. « 

(ix) FEATURE: 

(A) NAME/KEY: Modified -site 

(B) LOCATION: 927 

(D) OTHER INFORMATION: /label- GLYCOSYLATION 

/note- "Potential site of N-linked glycosylate. 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 958 

(D) OTHER INFORMATION : /label- GLYCOSYLATION 

/note- "Potential site of N-linked glycosylation. 

(ix) FEATURE: 

(A) NAME/KEY: Binding-site 

(B) LOCATION: 230.. 238 

(D) OTHER INFORMATION: /note- "Represents a putative 
cation binding domain." 

(ix) FEATURE: 

(A) NAME/KEY: Binding- site 

(B) LOCATION: 324.. 332 

(D) OTHER INFORMATION: /note- "Represents a putative 
cation binding domain." 

(ix) FEATURE: 

(A) NAME/KEY: Binding- site 

(B) LOCATION: 386.-394 \ • 
(D) OTHER INFORMATION: /note- "Represents a putative 

cation binding domain." 

(ix) FEATURE: 

(A) NAME/KEY: Binding -site 

(B) LOCATION: 441.. 449 

(D) OTHER INFORMATION: /note- "Represents a putative 
cation binding domain." 

(ix) FEATURE: 

(A) NAME/KEY : Domain 

(B) LOCATION: 1040.. 1044 
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(D) OTHER INFORMATION: /label- CYTOPLASMIC 

/note= "The cytoplasmic sequence, which is 
conserved in virtually all of the integrin ALPHA 
chains . " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 927. .1073 

(D) OTHER INFORMATION: /note= "The sequence encoded by the 
fragment of ALPHA 6A cDNA amplified using primers 
1156/1157." 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met Ala Ala Ala Gly Gin Leu Cys Leu Leu Tyr Leu Ser Ala Gly Leu 
1 5 10 15 

Leu Ser Arg Leu Gly Ala Ala Phe Asn Leu Asp Thr Arg Glu Asp Asn 
20 25 30 

Val lie Arg Lys Tyr Gly Asp Pro Gly Ser Leu Phe Gly Phe Ser Leu 
35 40 45 

Ala Met His Trp Gin Leu Gin Pro Glu Asp Lys Arg Leu Leu Leu Val 
50 55 60 

Gly Ala Pro Arg Gly Glu Ala Leu Pro Leu Gin Arg Ala Phe Arg Thr 
65 70 75 80 

Gly Gly Leu Tyr Ser Cys Asp He Thr Ala Arg Gly Pro Cys Thr Arg 
85 90 95 

He Glu Phe Asp Asn Asp Ala Asp Pro Thr Ser Glu Ser Lys Glu Asp 
100 105 110 

Gin Trp Met Gly Val Thr Val Gin Ser Gin Gly Pro Gly Gly Lys Val 
115 120 125 

Val Thr Cys Ala His Arg Tyr Glu Lys Arg Gin His Val Asn Thr Lys 
130 135 140 

Gin Glu Ser Arg Asp He Phe Gly Arg Cys Tyr Val Leu Ser Gin Asn 
145 150 155 160 

Leu Arg lie Glu Asp Asp Met Asp Gly Gly Asp Trp Ser Phe Cys Asp 
165 170 175 

Gly Arg Leu Arg Gly His Glu Lys Phe Gly Ser Cys Gin Gin Gly Val 
180 185 190 
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Ala Ala Thr Phe Thr Lys Asp Phe His Tyr He Val Phe Gly Ala Pro 
Lg5 200 205 

Gly Thr Tyr Asn Trp Lys Gly lie Val Arg Val Glu Gin Lys Asn Asn 

210 215 2 

Thr Phe Phe Asp Met Asn He Phe Glu Asp Gly Pro Tyr Glu Val Gly 
225 230 235 

Glv Glu Thr Glu His Asp Glu Ser Leu Val Pro Val Pro Ala Asn Ser 
' 245 2 50 255 

Tvr Leu Gly Phe Ser Leu Asp Ser Gly Lys Gly lie Val Ser Lys Asp 
3 260 265 270 

Glu He Thr Phe Val Ser Gly Ala Pro Arg Ala Asn His Ser Gly Ala 
275 280 285 

Val Val Leu Leu Lys Arg Asp Met Lys Ser Ala His Leu Leu Pro Glu 
290 295 300 

His He Phe Asp Gly Glu Gly Leu Ala Ser Ser Phe Gly Tyr Asp Val 
305 310 315 320 

Ala Val Het Asp Leu Asn Lys Asp Gly Trp Gin Asp He Val He Gly 
325 • 330 335 

Ala Pro Gin Tyr Phe Asp Arg Asp Gly Glu Val Gly Gly Ala Val Tyr 
340 345 350 

Val Tyr Met Asn Gin Gin Gly Arg Trp Asn Asn Val Lys Pro He Arg 
355 360 365 

Leu Asn Gly Thr Lys Asp Ser Met Phe Gly He Ala Val Lys Asn He 
370 * " 375 380 

Gly Asp He Asn Gin Asp Gly Tyr Pro Asp He Ala Val Gly Ala Pro 
3B 5 390 395 400 

Tyr Asp Asp Leu Gly Lys Val Phe He Tyr His Gly Ser Ala Asn Gly 
405 410 415 

He Asn Thr Lys Pro Thr Gin Val Leu Lys Gly He Ser Pro Tyr Phe 
420 425 430 

Gly Tyr Ser He Ala Gly Asn Met Asp Leu Asp Arg Asn Ser Tyr Pro 
435 440 445 

Asp Val Ala Val Gly Ser Leu Ser Asp Ser Val Thr He Phe Arg Ser 
450 455 460 
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Are Pro Val He Asn He Gin Lys Thr He Thr Val Thr Pro Asn Arg 
46 | 470 475 480 

He Asp Leu Arg Gin Lys Thr Ala Cys Cly Ala Pro Ser Gly lie Cys 
485 490 495 

Leu Gin Val Lys Ser Cys Phe Glu Tyr Thr Ala Asn Pro Ala Gly Tyr 
500 . 505 510 

Asn Pro Ser He Ser He Val Gly Thr Leu Glu Ala Glu Lys Glu Arg 
515 520 525 

Arg Lys Ser Gly Leu Ser Ser Arg Val Gin Phe Arg Asn Gin Gly Ser 
530 535 540 

Glu Pro Lys Tyr Thr Gin Glu Leu Thr Leu Lys Arg Gin Lys Gin Lys 
545 " 550 555 560 

Val Cys Met Glu Glu Thr Leu Trp Leu Cln Asp Asn He Arg Asp Lys 
565 570 575 

Leu Arg Pro He Pro He Thr Ala Ser Val Glu He Gin Glu Pro Ser 
580 585 590 

Ser Arg Arg Arg Val Asn Ser Leu Pro Glu Val Leu Pro He Leu Asn 
595 " • 600 605 

Ser Asp Glu Pro Lys Thr Ala His He Asp Val His Phe Leu Lys Glu 
610 615 620 

Gly Cys Gly Asp Asp Asn Val Cys Asn Ser Asn Leu Lys Leu Glu Tyr 
625 630 v 635 640 

Lys Phe Cys Thr Arg Glu Gly Asn Gin Asp Lys Phe Ser Tyr Leu Pro 
645 650 655 

He Gin Lys Gly Val Pro Glu Leu Val Leu Lys Asp Gin Lys Asp He 
660 665 670 

Ala Leu Glu He Thr Val Thr Asn Ser Pro Ser Asn Pro Arg Asn Pro 
675 680 685 

Thr Lys Asp Gly Asp Asp Ala His Glu Ala Lys Leu He Ala Thr Phe 
690 695 700 

Pro Asp Thr Leu Thr Tyr Ser Ala Tyr Arg Glu Leu Arg Ala Phe Pro 
705 710 715 720 

Glu Lys Gin Leu Ser Cys Val Ala Asn Gin Asn Gly Ser Gin Ala Asp 
725 730 735 
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Cys Glu Leu Gly Asn Pro Phe Lys Arg Asn Ser Asn Val Thr Phe Tyr 

reu Val Leu Ser Thr Thr Glu Val Thr Phe Asp Thr Pro Tyr Leu Asp 
755 760 765 

He Asn Leu Lys Leu Glu Thr Thr Ser Asn Gin Asp Asn Leu Ala Pro 
770 775 780 

He Thr Ala Lys Ala Lys Val Val He Glu Leu Leu Leu Ser Val Ser 
785 " 790 795 800 

Gly Val Ala Lys Pro Ser Gin Val Tyr Phe Gly Gly Thr Val Val Gly 
3 805 810 815 

Glu Gin Ala Met Lys Ser Glu Asp Glu Val Gly Ser Leu He Glu Tyr 
820 825 830 

Glu Phe Arg Val He Asn Leu Gly Lys Pro Leu Thr Asn Leu Gly Thr 
835 840 845 

Ala Thr Leu Asn He Gin Trp Pro Lys Glu He Ser Asn Gly Lys Trp 
850 855 860 

Leu Leu Tyx Leu Val Lys Val Glu Ser Lys Gly Leu Glu Lys Val Thr 
865 870 • «75 880 

Cys Glu Pro Gin Lys Glu He Asn Ser Leu Asn Leu Thr Glu Ser His 
885 890 895 

Asn Ser Arg Lys Lys Arg Glu He Thr Glu Lys Gin He Asp Asp Asn 
900 905 ' 10 

Are Lys Phe Ser Leu Phe Ala Glu Arg Lys Tyr Gin Thr Leu Asn Cys 
S ' 915 920 925 

Ser Val Asn Val Asn Cys Val Asn He Arg Cys Pro Leu Arg Gly Leu 
930 935 940 

Asp Ser Lys Ala Ser Leu He Leu Arg Ser Arg Leu Trp Asn Ser Thr 
94 5 950 955 960 

Phe Leu Glu Glu Tyr Ser Lys Leu Asn Tyr Leu Asp He Leu Met Arg 
965 970 975 

Ala Phe He Asp Val Thr Ala Ala Ala Glu Asn He Arg Leu Pro Asn 
980 985 990 

Ala Gly Thr Gin Val Arg Val Thr Val Phe Pro Ser Lys Thr Val Ala 
995 1000 3-005 
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Gin Tyr Ser Gly Val Pro Trp Trp lie lie Leu Val Ala lie Leu Ala 
1010 ' 1015 1020 

Glv lie Leu Met Leu Ala Leu Leu Val Phe He Leu Trp Lys Cys Gly 
10 25 1030 1035 1040 

Phe Phe Lys Arg Asn Lys Lys Asp His Tyr Asp Ala Thr Tyr His Lys 
1045 1050 1055 

Ala Glu He His Ala Gin Pro Ser Asp Lys Glu Arg Leu Thr Ser Asp 
1060 1065 1070 

Ala 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5629 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY : miscjeature 

(B) LOCATION: 1..5629 

<D) OTHER INFORMATION: /product- "Human ALPHA 6A" 

/note- "SEQ ID NO: 2 is the 5629 base nucleotide 
sequence of the human ALPHA 6A cDNA." 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 147.. 149 

(D) OTHER INFORMATION: /function- "Transcription : 
initiator" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 216.. 3365 

(D) OTHER INFORMATION: /product^ "Human ALPHA 6A n 



(ix) FEATURE: 

(A) NAME/KEY: miscjeacure 
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CB) LOCATION: 3264.. 3278 

CD) OTHER INFORMATION: /product" "The cytoplasmic sequence 
GFFKR. " 

(ix) FEATURE: 

(A) NAME/KEY: misc_feacure 
CB) LOCATION: 3261.-3390 

CD) OTHER INFORMATION: /note- "The 130 nucleotide sequence 
present in SEQ ID NO: 2 but deleted from SEQ ID 
NO: 4." 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
CB) LOCATION: 2924.. 3455 

CD) OTHER INFORMATION : /noce= "The sequence of the ALPHA 
6A cDNA amplified using primers 1156/1157." 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

GCGCGACCGT CCCGGGGGTG GGGCCGGGCG CAGCGGCGAG AGGAGGCGAA GGTGGCTGCG 60 

GTAGCAGCAG CGCGGCAGCC TCGGACCCAG CCCGGAGCGC AGGGCGGCCG CTGCAGGTCC 120 

CCGCTCCCCT CCCCGTGCGT CCGCCCATGG CCGCCGCCGG GCAGCTGTGC TTGCTCTACC 180 

TGTCGGCGGG GCTCCTGTCC CGGCTCGGCG CAGCCTTCAA CTTGGACACT CGGGAGGACA 240 

ACGTGATCCG GAAATATGGA GACCCCGGGA GCCTCTTCGG CTTCTCGCTG GCCATGCACT 300 

GGCAACTGCA GCCCGAGGAC AAGCGGCTGT TGCTCGTGGG GGCCCCGCGC GGAGAAGCGC 360 

TTCCACTGCA GAGAGCCTTC AGAACGGGAG GGCTGTACAG CTGCGACATC ACCGCCCGGG 420 

GGCCATGCAC GCGGATCGAG TTTGATAACG ATGCTGACCC CACGTCAGAA AGCAAGGAAG 480 

ATCAGTGGAT GGGGGTCACC GTCCAGAGCC AAGGTCCAGG- GGGCAAGGTC GTGACATGTG 540 

CTCACCGATA TGAAAAAAGG CAGCATGTTA ATACGAAGCA GGAATCCCGA GACATCTTTG 600 

GGCGGTGTTA TGTCCTGAGT CAGAATCTCA GGATTGAAGA CGATATGGAT GGGGGAGATT 660 

GGAGCTTTTG TGATGGGCGA TTGAGAGGCC ATGAGAAATT TGGCTCTTGC CAGCAAGGTG 720 

TAGCAGCTAC TTTTACTAAA GACTTTCATT ACATTGTATT TGGAGCCCCG GGTACTTATA 780 

ACTGGAAAGG GATTGTTCGT G TAG AG C AAA AGAATAACAC TTTTTTTGAC ATGAACATCT 840 

TTGAAGATGG GCCTTATGAA GTTGGTGGAG AGACTGAGCA TGATGAAAGT CTCGTTCCTG 900 

TTCCTGCTAA CAGTTACTTA GGTTTTTCTT TGGACTCAGG GAAAGGTATT GTTTCTAAAG 960 
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ATGAGATCAC TTTTGTATCT 


GGTGCTCCCA 


GAGCCAATCA 


CAGTGGAGCC 


GTGGTTTTGC 


1020 


TGAAGAGAGA CATGAAGTCT 


GCACATCTCC 


TCCCTGAGCA 


CATATTCGAT 


GGAGAAGGTC 


1080 


TGGCCTCTTC ATTTGGCTAT 


GATGTGGCGG 


TGATGGACCT 


CAACAAGGAT 


GGGTGGCAAG 


1140 


ATATACTTAT TGGAGCCCCA 


CAGTATTTTG 


ATAGAGATGG 


AGAAGTTGGA 


GGTGCAGTGT 


1200 


ATGTCTACAT GAACCAGCAA 


GGCAGATGGA 


ATAATGTGAA 


GCCAATTCGT 


CTTAATGGAA 


1260 


CCAAAGATTC TATGTTTGGC 


ATTGCAGTAA 


AAAATATTGG 


AGATATTAAT 


CAAGATGGCT 


1320 


ACCCAGATAT TGCAGTTGGA 


GCTCCGTATG 


ATGACTTGGG 


AAAGGTTTTT 


ATCTATCATG 


1380 


GATCTGCAAA TGGAATAAAT 


ACCAAACCAA 


CACAGGTTCT 


CAAGGGTATA 


TCACCTTATT 


1440 


TTGGATATTC AATTGCTGGA 


AACATGGACC 


TTGATCGAAA 


TTCCTACCCT 


GATGTTGCTG 


1500 


TTGGTTCCCT CTCAGATTCA 

X X w V» X X w w W X w * wx» a a v/f* 


GTAACTATTT 


TCAGATCCCG 


GCCTGTGATT 


AATATTCAGA 


1560 


AAAPCATPAC AGTAACTCCT 


AACAGAATTG 


ACCTCCGCCA 

** w w a w v w w wn 


GAAAACAGCG 


TGTGGGGCGC 


1620 


GTAGTGGGAT ATGCCTCCAG 

UlnUiwOuAl XiXwwwXwwX^w 


GTTAAATCCT 

\^ A A <AxMk A w A 


GTTTTGAATA 

V A A A A W«AfAA*A 


TACTGCTAAC 


CCCGCTGGTT 


1680 


ATAATCCTTC AATATCAATT 


GTGGGCACAC 


TTGAAGCTGA 


AAAAGAAAGA 


AG AAAATCTG 


1740 


GGCTATCCTC AAGAGTTCAG 


TTTCGAAACC 


AAGGTTCTGA 


GCCCAAATAT 


ACTCAAGAAC 


1800 


TAACTCTGAA GAG G C AG AAA 


CAGAAAGTGT 


GCATGGAGGA 


AACCCTGTGG 


CTACAGGATA 


1860 


ATATCAGAGA TAAACTGCGT 


CCCATTCCCA 


TAACTGCCTC 


AGTGGAGATC 


CAAGAGCCAA 


1920 


GCTCTCGTAG GCGAGTGAAT 


TCACTTCCAG 


AAGTTCTTCC 


AATTCTGAAT 


TCAGATGAAC 


1980 


CCAAGACAGC TCATATTGAT 


GTTCACTTCT 


TAAAAGAGGG 


ATGTGGAGAC 


GACAATGTAT 


2040 


GTAACAGCAA CCTTAAACTA 


GAATATAAAT 


TTTGCACCCG 


AGAAGGAAAT 


CAAGACAAAT 


2100 


TTTCTTATTT ACCAATTCAA 


AAAGGTGTAC 


CAGAACTAGT 


TCTAAAAGAT 


CAGAAGGATA 


2160 


TTGCTTTAGA AATAACAGTG 


ACAAACAGCC 


C 11 C CAACCC 


AAGGAATCCC 


ACAAAAGATG 




GCGATGACGC CCATGAGGCT 


AAACTGATTG 


CAACGTTTCC 


AGACACTTTA ACCTATTCTG 


2280 


CATATAGAGA ACTGAGGGCT 


TTCCCTGAGA 


AACAGTTGAG 


TTGTGTTGCC 


AACCAGAATG 


2340 


GCTCGCAAGC TGACTGTGAG 


CTCGGAAATC 


CTTTTAAAAG 


AAATTCAAAT 


GTCACTTTTT 


2400 


ATTTGGTTTT AAGTACAACT 


GAAGTCACCT 


TTGACACCCC 


ATATCTGGAT 


ATTAATCTGA 


2 A 60 


AGTTAGAAAC AACAAGCAAT 


CAAGATAATT 


TGGCTCCAAT 


TACAGCTAAA 


GCAAAAGTGG 


2520 
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TTATTGAACT GCTTTTATCG 


GTCTCGGGAG 


TTGCTAAACC 


TTCCCAGGTG 


TATTTTGGAG 


2580 


GTACAGTTGT TGGCGAGCAA 


GCTATGAAAT 


CTGAAGATGA AGTGGGAAGT 


TTAATAGAGT 


2640 


ATGAATTCAG GGTAATAAAC 


TTAGGTAAAC 


CTCTTACAAA 


CCTCGGCACA 


GCAACCTTGA 


2700 


ACATTCAGTG GCCAAAAGAA ATTAGCAATG 


GGAAATGGTT 


GCTTTATTTG 


GTGAAAGTAG 


2760 


AATCCAAAGG ATTGGAAAAG 


GTAACTTGTG 


AGCCACAAAA 


GGAGATAAAC 


TCCCTGAACC 


2820 


TAACGGAGTC TCACAACTCA 


AGAAAGAAAC 


GGGAAATTAC 


TGAAAAACAG 


ATAGATGATA 


2880 


AC AG AAAAT X TTCTTTATTT 


GCTGAAAGAA 


AATACCAGAC 


TCTTAACTGT 


AGCGTGAACG 


2940 


TGAACTGTGT GAACAiuAbA 


TGCCCGCTGC 


GGGGGCTGGA 


CAGCAAGGCG 


TCTCTTA^TT 


3000 


TGCGCTCGAG GTTATGGAAu 


AGCACATTTC TAGAGGAATA TTCCAAACTG 


AACTACTTGG 


3060 


ACATTCTCAT GCGAGCCXIC 


ATTGATGTGA 


CTGCTGCTGC 


CGAAAATATC 


AGGCTGCCAA 


3120 


ATGCAGGCAC TCAGGTlObA 


GTGACTGTGT 


TTCCCTCAAA 


GACTGTAGCT 


CAGTATTCGG 


3180 


GAGTACCTTG GTG GAT CATC 


CTAGTGGCTA TTCTCGCTGG 


GATCTTGATG 


CTTGCTTTAT 


3240 


TAGTGTTTAT ACTATGGAAG 


TGTGGTTTCT TCAAGAGAAA 


TAAGAAAGAT 


CATTATGATG 


3300 


CCACATATCA CAAGGCTGAG 


ATCCATGCTC 


AGCCATCTGA 


TAAAGAGAGG 


CTTACTTCTG 


3360 


ATGCATAGTA TTGATCTACT 


TCTGTAATTG 


TGTGGATTCT 


TTAAACGCTC 


TAG GT AC GAT 


3420 


GACAGTGTTC CCCGATACCA 


TGCTGTAAGG 


ATCCGGAAAG 


AAG AG C GAGA 


GATCAAAGAT 


3480 


GAAAAGTATA TTGATAACCT 


TGAAAAAAAA 


CAGTGGATCA 


CAAAGTGGAA 


CAGAAATGAA 


3540 


AGCTACTCAT AGCGGGGGCC 


TAAAAAAAAA AAAGCTTCAC 


AGTACCCAAA 


CTGCTTTTTC 


3600 


CAACTCAGAA ATTCAATTTG 


GATTTAAAAG 


CCTGCTCAAT 


CCCTGAGGAC 


TGATTTCAGA 


3660 


GTGACTACAC ACAGTAC GAA 


CCTACAGTTT 


TAACTGTGGA 


TATTGTTACG 


TAGCCTAAGG 


3720 


CTCCTGTTTT GCACAGCCAA ATTTAAAACT 


GTTGGAATGG 


ATTTTTCTTT 


AACTGCCGTA 


3780 


ATTTAACTTT CTGGGTTGCC 


TTTGTTTTTG 


GCGTGGCTGA 


CTTACATCAT 


GTGTTGGGGA 


3840 


AGGGCCTGCC CAGTTGCACT 


CAGGTGACAT 


CCTCCAGATA 


GTGTAGCTGA 


GGAGGCACCT 


3900 


ACACTCACCT GCACTAACAG 


AGTGGCCGTC 


CTAACCTCGG 


GCCTGCTGCG 


CAGACGTCCA 


39*0 


TCACGTTAGC TGTCCCACAT 


CACAAGACTA TGCCATTGGG 


GTAGTTGTGT 


TTCAACGGAA 


4020 


AGTGCTGTCT TAAACTAAAT 


GTGCAATAGA AGGTGATGTT 


GCCATCCTAC 


CGTCTTTTCC 


4080 
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TGTTTCCTAG 




APGTGCTCAC 


GTCAAATGCA 


TACAAGTTTC 


ATTCTCCCTT . 


4140 


TCACTAAAAA 


CACACAOb J.L» 


PA A P AG APTT 


GA ATGCTAGT 


TATACTTATT 


TGTATATGGT . 


4200 


ATTTATTTTT 


TCi 1 X X O X X X 


AP A A AGC ATT 


TTGTTATTGA 


CTAACAGGCC 


AAAGAGTCTC 


4260 


CAGTTTACCC 


rr»rp /"* A PPTTPP^ 


TTTAATCAAT 

XXX /in X wftA X 


CAGAATTAGA 

VAwAAl a*^wa* 


ATTAGAGCAT 


GGGAGGGTCA 


4320 


TCACTATGAC 


j-«rp AAA TT A TT 

CTAAAi XAi 1 


rn a PTHPA AAA 
X AO X \J Li AAAA 


AG A A A ATCTT 


TATAAATGTA 


C GAG AG AG AG 


4380 


TTGTTTTAAT 


A A PTT 1 A TPT A 

AACX 1A101A 


T A A A PTAT A A 
X AAA L» X A X AA 


VswXwXwwXXw 


ATGACAGCCT 


CCACCCCACA 


4440 


ACCCAAAAGG 


rprp*p » A f* A A A T 

TTTAAb AAA X 


A P A A TTA TA A 
AvjAAX X A1AA 


PTGTAAAGAT 


GTTTATTTCA 

W X X- A AA AAA WA» 


GGCATTGGAT 


4500 


ATTTTTTACT 


TTAGAAGCCT 


r* A T A A TfiTT 

Vj u A 1 AA X u X X 


lviuunii X XA 


GATAGTGTAA 

V/ AA X AA W A W A AAA* 


CATTCAGGAji 


4560 


TTCTTGGAGA 


AGATGGGTTT 


ATTCACTGAA 


CX C IAG IbUO 


U X X X Aw X OAL» 


TG CTG C AAA T 


4620 


AC TG TAT ATT 


CAGGACTTGA 


A A ^ AAA fTl r~> f> T> 

AAG AAATG GT 


(lAAlbLw X AX 


L> bAAO X AO X O 


GATCCAAACT 


4680 


GATCCAGTAT 


AAGACTACTG 


AATCTGCTAC 


^ A A A A /"* A PfT 

CAAAACAG I 1 


A ATP A P Tfi A CI 
AAX L/Avj X whva 


TPGAGTGTTC 

X woAw X Vj X X O 


4740 


TATTTTTTGT 


TTTGTTTCCT 


CCCCTATCTG 


m A mm /"» /"» A A A 

TATTCCCAAA 


A A TTi PTTTP 

AAX X Av X X Xo 


GGGPTA ATTT 


4800 


AACAAGAACT 


TTAAATTGTG 


TTTTAATTGT 


A A A A A TPPP A 


PPPPGTGGA A 


TT ATT A C T CT 

X AAA A 1AU X W* 


4860 


ATACATTCAA 


CAGAGACTGA 


ATAGATATGA 


A A /"* PTP A TTT 
AAvjU X uAX X X 


TTTTTAATTA 

X X X X X <if* A A ATk 


CCATGCTTCA 

\J W4i A W W A A ^*A* 


4920 


CAATGTTAAG 


TTATATG GGG 


AGCAACAGCA 


A A P APPTGPf 


AATTTGTTTT 

/aaa XXAWAAAA 


GG AT ATAGTA 


4980 


TAAGCAGTGT 


CTGTGTTTTG 


AAAGAATAGA 


AP APAGTTTG 
nOnUnu X x X w 


TAGTGCCACT 


GTTGTTTTGG 


5040 


GGGGGGCTTT 


TTTTCTTTTT 


CCGGAAAATC 


GTTAAACCTT 


AAGATACTAA 


GGACGTTGTT 


5100 


TTGGTTGTAC 


TTGGAATTCT 


TAGTCACAAA 


ATATATTTTG 

A *» X *A A X A A. w 


TTTACAAAAA 


TTTCTGTAAA 


5160 


ACAGGTTATA ACAGTGTTTA AAGTCTCAGT 


TTCTTG CTTG 

X X V A A> W W A A W 


GGGAACTTGT 


GTCCCTAATG 


5220 


TGTTAGATTG 


CTAGATTGCT 


AAGGAGCTGA 


TACTTGACAG 


TTTTTTAGAC 


CTGTGTTACT 


5280 


AAAAAAAAGA 


TGAATGTCGG 


AAAAGGGTGT 


TGGGAGGGTG 


GTCAACAAAG 


AAAPAAAGAT 


5340 


GTTATGGTGT TTAGACTTAT 


GGTTGTTAAA 


AATGTCATCT 


CAAGTCAAGT 


CACTGGTCTG 


5400 


TTTGCATTTG ATACATTTTT GTACTAACTA 


GCATTGTAAA 


ATTATTTCAT 


GATTAGAAAT 


5460 


TACCTGTGGA 


TATTTGTATA AAAGTGTGAA 


ATAAATTTTT 


TATAAAAGTG 


TTCATTGTTT 


5520 


CGTAACACAG 


CATTGTATAT 


GTGAAGCAAA 


CTCTAAAATT ATAAATGACA 


ACCTGAATTA 


5580 


TCTATTTCAT 


CAAAAAAAAA 


AAAAAAAAAA 


ACTTTATGGG 


CACAACTGG 




5629 



WO 92/19647 



PCT/US92/03S27 



78 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1091 amino acids 
* (B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

Cii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1..1091 

(D) OTHER INFORMATION : /note= M SEQ ID NO: 3 is the 1091 

residue amino acid sequence of the human ALPHA 6B 
protein." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1. .1044 „ ' 
(D) OTHER INFORMATION: /note= "The sequence of SEQ ID NO: 3 

is identical to SEQ ID NO:l between amino acids 1 
and 1044." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 927.. 1060 

CD) OTHER INFORMATION: /note* "Encompasses the sequence 

encoded by the fragment of ALPHA 6B cDNA amplified 
using primers 115 6/115 7." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ala Ala Ala Gly Gin Leu Cys Leu Leu Tyr Leu Ser Ala Gly Leu 
x 5 10 15 

Leu Ser Arg Leu Gly Ala Ala Phe Asn Leu Asp Thr Arg Glu Asp Asn 
20 25 30 

Val lie Arg Lys Tyr Gly Asp Pro Gly Ser Leu Phe Gly Phe Ser Leu 
35 40 45 

Ala Met H*s Trp Gin Leu Gin Pro Glu Asp Lys Arg Leu Leu Leu Val 
50 55 60 
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Gly Ala Pro Arg Gly Glu Ala Leu Pro Leu Gin Arg Ala Phe Arg Thr 
65 70 75 80 

Gly Gly Leu Tyr Ser Cys Asp He Thr Ala Arg Gly Pro Cys Thr Arg 
85 90 95 

He Glu Phe Asp Asn Asp Ala Asp Pro Thr Ser Glu Ser Lys Glu Asp 
100 105 110 

Gin Trp Met Gly Val Thr Val Gin Ser Gin Gly Pro Gly Gly Lys Val 
115 " 120 125 

Val Thr Cys Ala His Arg Tyr Glu Lys Arg Gin His Val Asn Thr Lys 
130 135 140 

Gin Glu Ser Arg Asp He Phe Gly Arg Cys Tyr Val Leu Ser Gin Asn 
145 150 155 160 

Leu Arg He Glu Asp Asp Met Asp Gly Gly Asp Trp Ser Phe Cys Asp 
165 170 175 

Gly Arg Leu Arg Gly His Glu Lys Phe Gly Ser Cys Gin Gin Gly Val 
180 " 185 190 

Ala Ala Thr Phe Thr Lys Asp Phe His Tyr He Val Phe Gly Ala Pro 
195 200 205 

Gly Thr Tyr Asn Trp Lys Gly He Val Arg Val Glu Gin Lys Asn Asn 
210 215 220 

Thr Phe Phe Asp Met Asn He Phe Glu Asp Gly Pro Tyr Glu Val Gly 
225 230 235 240 

Gly Glu Thr Glu His Asp Glu Ser Leu Val Pro Val Pro Ala Asn Ser 
245 250 255 

Tyr Leu Gly Phe Ser Leu Asp Ser Gly Lys Gly He Val Ser Lys Asp 
260 265 270 

Glu He Thr Phe Val Ser Gly Ala Pro Arg Ala Asn His Ser Gly Ala 
275 280 . 285 

Val Val Leu Leu Lys Arg Asp Met Lys Ser Ala His Leu Leu Pro Glu 
290 295 300 

His He Phe Asp Gly Glu Gly Leu Ala Ser Ser Phe Gly Tyr Asp Val 
305 310 315 320 

Ala Val Met Asp Leu Asn Lys Asp Gly Trp Gin Asp He Val He Gly 
325 330 335 
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Ala Pro Gin Tyr Phe Asp Arg Asp Gly Clu Val Gly Gly Ala Val Tyr 
340 345 350 



Val Tyr Met Asn Gin Gin Gly Arg Trp Asn Asn Val Lys Pro lie Arg 
355 360 365 



Leu Asn Gly Thr Lys Asp Ser Met Phe Gly lie Ala Val Lys Asn He 
370 375 380 

Gly Asp He Asn Gin Asp Gly Tyr Pro Asp He Ala Val Gly Ala Pro 
385 390 395 400 

Tvr Asp Asp Leu Gly Lys Val Phe He Tyr His Gly Ser Ala Asn Gly 
' 405 410 415 

He Asn Thr Lys Pro Thr Gin Val Leu Lys Gly He Ser Pro Tyr Phe 
420 425 430 

Gly Tyr Ser He Ala Gly Asn Met Asp Leu Asp Arg Asn Ser Tyr Pro 
435 440 445 

Asp Val Ala Val Gly Ser Leu Ser Asp Ser Val Thr He Phe Arg Ser 
450 455 460 

Are Pro Val He Asn He Gin Lys Thr lie Thr Val Thr Pro Asn Arg 
4 6 f 470 • 475 480 

He Asp Leu Arg Gin Lys Thr Ala Cys Gly Ala Pro Ser Gly lie Cys 
485 490 495 

Leu Gin Val Lys Ser Cys Phe Glu Tyr Thr Ala Asn Pro Ala Gly Tyr 
500 505 510 

Asn Pro Ser He Ser He Val Gly Thr Leu Glu Ala Glu Lys Glu Arg 
515 520 525 

Are Lys Ser Gly Leu Ser Ser Arg Val Gin Phe Arg Asn Gin Gly Ser 
530 " 535 540 



Glu Pro Lys Tyr Thr Gin Glu Leu Thr Leu Lys Arg Gin Lys Gin Lys 
545 * 550 555 560 

Val Cys Met Glu Glu Thr Leu Trp Leu Gin Asp Asn He Arg Asp Lys 



565 



570 575 



Leu Arg Pro lie Pro He Thr Ala Ser Val Glu He Gin Glu Pro Ser 
580 585 590 

Ser Arg Arg Arg Val Asn Ser Leu Pro Glu Val Leu Pro He Leu Asn 
595 600 6 °5 



WO 92/19647 



PCT/US92/03527 



81 



Ser Asp Glu Pro Lys Thr Ala His lie Asp Val His Phe Leu Lys Glu 
610 615 620 

Gly Cys Gly Asp Asp Asn Val Cys Asn Ser Asn Leu Lys Leu Glu Tyr 
625 630 ' 635 6 AO 

Lys Phe Cys Thr Arg Glu Gly Asn Gin Asp Lys Phe Ser Tyr Leu Pro 
645 650 " " 655 

lie Gin Lys Gly Val Pro Glu Leu Val Leu Lys Asp Gin Lys Asp lie 
660 665 670 

Ala Leu Glu lie Thr Val Thr Asri Ser Pro Ser Asn Pro Arg Asn Pro 
675 680 685 

Thr Lys Asp Gly Asp Asp Ala His Glu Ala Lys Leu lie Ala Thr Phe 
690 695 700 

Pro Asp Thr Leu Thr Tyr Ser Ala Tyr Arg Glu Leu Arg Ala Phe Pro 
705 710 715 720 

Glu Lys Gin Leu Ser Cys Val Ala Asn Gin Asn Gly Ser Gin Ala Asp 
725 730 735 

Cys Glu Leu Gly Asn Pro Phe Lys Arg Asn Ser Asn Val Thr Phe Tyr 
740 745 750 

Leu Val Leu Ser Thr Thr Glu Val Thr Phe Asp Thr Pro Tyr Leu Asp 
755 760 765 



lie Asn Leu Lys Leu Glu Thr Thr Ser Asn Gin Asp Asn Leu Ala Pro 
770 ' 775 780 

lie Thr Ala Lys Ala Lys Val Val lie Glu Leu Leu Leu Ser Val Ser 
785 790 795 800 

Gly Val Ala Lys Pro Ser Gin Val Tyr Phe Gly Gly Thr Val Val Gly 
805 810 815 

Glu Gin Ala Met Lys Ser Glu Asp Glu Val Gly Ser Leu lie Glu Tyr 
820 825 830 

Glu Phe Arg Val lie Asn Leu Gly Lys Pro Leu Thr Asn Leu Gly Thr 
835 840 845 

Ala Thr Leu Asn lie Gin Trp Pro Lys Glu lie Ser Asn Gly Lys Trp 
850 855 860 

Leu Leu Tyr Leu Val Lys Val Glu Ser Lys Gly Leu Glu Lys Val Thr 
865 870 875 880 
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Cys Glu Pro Gin Lys Glu He Asn Ser Leu Asn Leu Thr Glu Ser His 

Glu Lys Gin He Asp 
900 905 910 



885 

Asn Ser Arg Lys Lys. Arg Glu He Thr Glu Lys Gin lie Asp Asp Asn 



Are Lys Phe Ser Leu Phe Ala Glu Arg Lys Tyr Gin Thr Leu Asn Cys 
915 920 925 

Ser Val Asn Val Asn Cys Val Asn He Arg Cys Pro Leu Arg Gly Leu 
930 935 940 

Asp Ser Lys Ala Ser Leu He Leu Arg Ser Arg Leu Trp Asn Ser Thr 

Phe Leu Glu Glu Tyr Ser Lys Leu Asn Tyr Leu Asp He Leu Met Arg 
965 970 975 

Ala Phe He Asp Val Thr Ala Ala Ala Glu Asn He Arg Leu Pro Asn 
980 985 990 

Ala Gly Thr Gin Val Arg Val Thr Val Phe Pro Ser Lys Thr Val Ala 
995 1000 1005 

Gin Tyr Ser Gly Val Pro Trp Trp lie He Leu Val Ala He Leu Ala 
1010 1015 1020 

Gly lie Leu Met Leu Ala Leu Leu Val Phe He Leu Trp Lys Cys Gly 
1025 1030 1035 10™ 

Phe Phe Lys Arg Ser Arg Tyr Asp Asp Ser Val Pro Arg Tyr His Ala 
1045 1° 50 1055 

Val Arg He Arg Lys Glu Glu Arg Glu He Lys Asp Glu Lys Tyr He 
1060 1065 1070 

Asp Asn Leu Glu Lys Lys Gin Trp He Thr Lys Trp Asn Arg Asn Glu 
1075 1080 1085 

Ser Tyr Ser 
1090 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: - - 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 1 . '. 5499 

(D) OTHER INFORMATION : /product- "Human ALPHA 6B" 

<ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..3260 

(D) OTHER INFORMATION: /note= "The sequence of SEQ ID NO: A 
is identical to SEQ ID NO: 2 between nucleotides 1 
and 3260." 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 326l7.5499 

(D) OTHER INFORMATION: /note* "Nucleotides 3261-5499 of 

SEQ ID NO:4 are identical to nucleotides 3391-5629 
of SEQ ID NO: 2. SEQ ID NO: 4 has a 130 nucleotide 
deletion in relation to SEQ ID N0:2." 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 2924.-3325 

(D) OTHER INFORMATION: /note* "Encompasses the sequence of 
the ALPHA 6B cDNA amplified using primers 
1156/1157." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GCGCGACCGT 


CCCGGGGGTG 


GGGCCGGGCG 


CAGCGGCGAG 


AGGAGGCGAA 


GGTGGCTGCG 


60 


GTAGCAGCAG 


CGCGGCAGCC 


TCGGACCCAG 


CCCGGAGCGC 


AGGGCGGCCG 


CTGCAGGTCC 


120 


CCGCTCCCCT 


CCCCGTGCGT 


CCGCCCATGG 


CCGCCGCCGG 


GCAGCTGTGC 


TTGCTCTACC 


180 


TGTCGGCGGG 


GCTCCTGTCC 


CGGCTCGGCG 


CAGCCTTCAA 


CTTGGACACT 


CGGGAGGACA 


240 


ACGTGATCCG 


GAAATATGGA 


GACCCCGGGA 


GCCTCTTCGG 


CTTCTCGCTG 


GCCATGCACT 


300 


GGCAACTGCA 


GCCCGAGGAC 


AAGCGGCTGT 


TGCTCGTGGG. 


GGCCCCGCGC 


GGAGAAGCGC 


.360 


TTCCACTGCA 


GAGAGCCTTC 


AGAACGGGAG 


GGCTGTACAG 


CTGCGACATC 


ACCGCCCGGG 


420 


GGCCATGCAC 


GCGGATCGAG 


TTTGATAACG 


ATGCTGACCC 


CACGTCAGAA 


AGCAAGGAAG 


480 
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ATCAGTGGAT GGGGGTCACC 


GTCCAGAGCC AAGGTCCAGG GGGCAAGGTC 


GTGACATGTG 


540 


CTCACCGATA TGAAAAAAGG 


CAGCATGTTA ATACGAAGCA GGAATCCCGA 


GACATCTTTG 


600 


GGCGGTGTTA TGTCCTGAGT 


CAGAATCTCA GGATTGAAGA CGATATCGAT 


GGGG GAG ATT 


6 60 


GGAGCTTTTG TGATGGGCGA 


TTGAGAGGCC ATGACAAATT TGGCTCTTGC 


CAGCAAGGTG 


720 


TAGCAGCTAC TTTTACTAAA 


GACTTTCATT ACATTGTATT TGGAGCCCCG 


GGTACTTATA 


780 


ACTGGAAAGG GATTGTTCGT 


GTAGAGCAAA AGAATAACAC TTTTTTTGAC 


ATGAACATCT 


840 


TTGAAGATGG GCCTTATGAA 


GTTGGTGGAG AGACTGAGCA TGATGAAAGT 


CTCGTTCCTG 


900 


TTCCTGCTAA CAGTTACTTA 


GGTTTTTCTT TGGACTCAGG GAAAGGTATT 


GTTTCTAAAG 


960 


ATGAGATCAC TTTTGTATCT 


GGTGCTCCCA GAGCCAATCA CAGTGGAGCC 


GTGGTTTTGC 


1020 


TGAAGAGAGA CATGAAGTCT 


GCACATCTCC TCCCTGAGCA CATATTCGAT 


GGAGAAGGTC 


1080 


TGGCCTCTTC ATTTGGCTAT 


GATGTGGCGG TGATGGACCT CAACAAGGAT 


GGGTGGCAAG 


1140 


ATATAGTTAT TGGAGCCCCA 


CAGTATTTTG ATAGAGATGG AGAAGTTGGA 


GGTGCAGTGT 


1200 


ATGTCTACAT GAACCAGCAA 


GGCAGATGGA ATAATGTGAA GCCAATTCGT 


CTTAATGGAA 


1260 


CCAAAGATTC TATGTTTGGC 


ATTGCAGTAA AAAATATTGG AGATATTAAT 


CAAGATGGCT 


1320 


ACCCAGATAT TGCAGTTGGA 


GCTCCGTATG ATGACTTGGG AAAGGTTTTT 


ATCTATCATG 


1380 


GATCTGCAAA TGGAATAAAT 


ACCAAACCAA CACAGGTTCT CAAGGGTATA 


TCACCTTATT 


1440 


TTGGATATTC AATTGCTGGA AACATGGACC TTGATCGAAA TTCCTACCCT 


GATGTTGCTG 


1500 


TTGGTTCCCT CTCAGATTCA 


GTAACTATTT TCAGATCCCG GCCTGTGATT 


AATATTCAGA 


1560 


AAACCATCAC AGTAACTCCT 


AACAGAATTG ACCTCCGCCA G AAAAC AG CG 


TGTGGGGCCC 


1620 


CTAGTGGGAT ATGCCTCCAG GTTAAATCCT GTTTTGAATA TACTGCTAAC 


CCCGCTGGTT 


1680 


ATAATCCTTC AATATCAATT 


GTGGG CACAC TTGAAGCTGA AAAAGAAAGA 


AGAAAATCTG 


1740 


GGCTATCCTC AAGAGTTCAG 


TTTCGAAACC AAGGTTCTGA GCCCAAATAT 


ACTCAAGAAC 


1800 


TAACTCTGAA GAGGCAG AAA 


CAGAAAGTGT GCATGGAGGA AACCCTGTGG 


CTACAGGATA 


1860 


AT AT C AG AG A TAAACTGCGT 


CCCATTCCCA TAACTGCCTC AGTGGAGATC 


CAAGAGCCAA 


1920 


GCTCTCGTAG GCGAG TGAAT 


TCACTTCCAG AAGTTCTTCC AATTCTGAAT 


TCAGATGAAC 


1980 


CCAAGACAGC TCATATTGAT 


GTTCACTTCT TAAAAGAGGG ATGTGGAGAC 


GACAATGTAT 


2040 
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GTAACAGCAA 


CCTTAAACTA 


GAATATAAAT 


TTTGCACCCG 


AGAAGGAAAT 


CAAGACAAAT 


2100 


TTTCTTATTT 


ACCAATTCAA 


AAAGGTGTAC 


CAGAACTAGT 


T CTAAAAG AT 


CAGAAGGATA 


2160 


TTGCTTTAGA AATAACAGTG 


ACAAACAGCC 


CTTCCAACCC AAGGAATCCC 


ACAAAAGATG 


2220 


GCGATGACGC 


CCATGAGGCT 


AAACTGATTG 


CAACGTTTCC 


AGACACTTTA 


ACCTATTCTG 


2280 


CATATAGAGA 


ACTGAGGGCT 


TTCCCTGAGA 


AACAGTTGAG 


TTGTGTTGCC 


AACCAGAATG 


2340 


GCTCGCAAGC 


TGACTGTGAG 


CTCGGAAATC 


CTTTTAAAAG 


AAATTCAAAT 


GTCACTTTTT 


2400 


ATTTGGTTTT 


AAGTACAACT 


GAAGTCACCT 


TTGACACCCC 


ATATCTGGAT 


ATTAATCTGA 


2460 


AGTTAGAAAC 


AACAAGCAAT 


CAAGATAATT 


TGGCTCCAAT 


TACAGCTAAA 


GCAAAAGTGG 


2520 


TTATTGAACT 


GCTTTTATCG 


GTCTCGGGAG 


TTGCTAAACC 


TTCCCAGGTG 


TATTTTGGAG 


2580 


GTACAGTTGT 


TGGCGAGCAA 


GCTATGAAAT 


CTGAAGATGA 


AGTGGGAAGT 


TTAATAGAGT 


2640 


ATGAATTCAG 


GGTAATAAAC 


TTAGGTAAAC 


CTCTTACAAA 


CCTCGGCACA 


GCAACCTTGA 


2700 


ACATTCAGTG 


GCCAAAAGAA 


ATTAGCAATG 


GGAAATGGTT 


GC TTTATTTG 


GTGAAAGTAG 


2760 


AATCCAAAGG 


ATTGGAAAAG 


GTAACTTGTG 


AGCCACAAAA 


GGAGATAAAC 


TCCCTGAACC 


2820 


TAACGGAGTC 


TCACAACTCA 


AGAAAGAAAC 


GGGAAATTAC 


TGAAAAACAG 


ATAGATGATA 


2880 


ACAGAAAATT 


TTCTTTATTT 


GCTGAAAGAA 


AATACCAGAC 


TCTTAACTGT 


AGCGTGAACG 


2940 


TGAACTGTGT 


GAACATCAGA 


TGCCCGCTGC 


GGGGGCTGGA 


CAGCAAGGCG TCTCTTATTT 


3000 


TGCGCTCGAG 


GTTATGGAAC 


AGCACATTTC 


TAGAGGAATA 


TTCCAAACTG 


AACTACTTGG 


3060 


ACATTCTCAT 


GCGAGCCTTC 


ATTGATGTGA 


CTGCTGCTGC 


CGAAAATATC 


AGGCTGCCAA 


3120 


ATGCAGGCAC 


TCAGGTTCGA 


GTGACTGTGT 


TTCCCTCAAA 


GACTGTAGCT 


CAGTATTCGG 


3180 


GAGTACCTTG 


GTGGATCATC 


CTAGTGGCTA 


TTCTCGCTGG 


GATCTTGATG 


CTTGCTTTAT 


3240 


TAGTGTTTAT 


ACTATGGAAG 


TGTGGATTCT 


TTAAACGCTC 


TAGGTACGAT 


GACAGTGTTC . 


3300 


CCCGATACCA 


TGCTGTAAGG 


ATCCGGAAAG 


AAGAGCGAGA 


GATCAAAGAT 


G AAAAG TATA 


3360 


TTGATAACCT 


TGAAAAAAAA 


CAGTGGATCA 


CAAAGTGGAA 


CAGAAATGAA 


AGCTACTCAT 


3420 


AGCGGGGGCC 


TAAAAAAAAA 


AAAGCTTCAC 


AGTACCCAAA 


CTGCTTTTTC 


CAACTCAGAA 


3480 


ATTCAATTTG 


GATTTAAAAG 


CCTGCTCAAT 


CCCTGAGGAC 


TGATTTCAGA 


GTGACTACAC 


3540 


ACAGTACGAA 


CCTACAGTTT 


TAACTGTGGA 


TATTGTTACG 


TAGCCTAAGG 


CTCCTGTTTT 


3600 
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GCACAGCCAA ATTTAAAACT GTTGGAATGG ATTTTTCTTT 
" CTGGGTTGCC TTTGTTTTTG GCGTGGCTGA CTTACATCAT 
CAGTTGCACT CAGGTGACAT CCTCCAGATA GTGTAGCTGA 
GCACTAACAG AGTGGCCGTC CTAACCTCGG GCCTGCTGCG 
TGTCCCACAT CACAAGACTA TGCCATTGGG GTAGTTGTGT 
TAAACTAAAT GTGCAATAGA AGGTGATGTT GCCATCCTAC 
CTGTGTGAAT ACCTGCTCAC GTCAAATGCA TACAAGTTTC 
CACACAGGTG CAACAGACTT GAATGCTAGT TATACTTATT 
TCTTTTCTTT AC AAACCATT TTGTTATTGA CTAACAGGCC 
TTCAGGTTGG TTTAATCAAT CAGAATTAGA ATT AG AG CAT 
CTAAAT TATT TACTGCAAAA AGAAAATCTT TATAAATGTA 
AACTTATCTA TAAACTATAA CCTCTCCTTC ATGACAGCCT 
TTTAAGAAAT AGAATTATAA CTGTAAAGAT GTTTATTTCA 
TTAGAAGCCT GCATAATGTT TCTGGATTTA CATACTGTAA 
AGATGGGTTT ATTCACTGAA CTCTAGTGCG GTTTACTCAC 
CAGGACTTGA AAGAAATGGT GAATGCCTAT GGAACTAGTG 
AAGACTACTG AATCTGCTAC CAAAACAGTT AATCAGTGAG 
TTTGTTTCCT CCCCTATCTG TATTCCCAAA AATTACTTTG 
TTAAATTGTG TTTTAATTGT AAAAATGGCA GGGGGTGGAA 
CAGAGACTGA ATAGATATGA AAGCTGATTT TTTTTAATTA 
TTATATG GGG AGCAACAGCA AACAGGTGCT AATTTGTTTT 
CTGTGTTTTG AAAGAATAGA ACACAGTTTG TAGTGCCACT 
TTTTCTTTTT CCGGAAAATC CTTAAACCTT AAGATACTAA 
TTGGAATTCT TAGTCACAAA ATATATTTTG TTTACAAAAA 
ACAGTGTTTA AAGTCTCAGT TTCTTGCTTG GGGAACTTGT 
CTAGATTGCT AAGGAGCTGA TACTTGACAG TTTTTTAGAC 



AACTGCCGTA ATTTAACTTT 3660 

GTGTTGGGCA AGGGCCTGCC 3720 

GGAGGCACCT ACACTCACCT 3780 

CAGACGTCCA TCACGTTAGC 3840 

TTCAACGGAA AGTGCTGTCT 3900 

CGTCTTTTCC TGTTTCCTAG 39 60 

ATTCTCCCTT TCACTAAAAA A020 

TGTATATGGT ATTTATTTTT 4080 

AAAGAGTCTC CAGTTTACCC 4140 

GGGAGGGTCA TCACTATGAC 4200 

CC AG AG AG AG TTGTTTTAAT 4260 

CCACCCCACA ACCCAAAAGG 4320 

GGCATTGGAT ATTTTTTACT 4380 

CATTCAGGAA TTCTTGGAGA 4440 

TGCTGCAAAT ACTGTATATT 4500 

GATCCAAACT GATCCAGTAT 4560 

TCGAGTGTTC TATTTTTTGT 4620 

GGG CTAATTT AACAAGAACT 4680 

TTATTACTCT ATACATTCAA 4740 

CCATGCTTCA CaATGTTAAG 4800 

GG AT ATAGTA TAAGCAGTGT 4860 

GTTGTTTTGG GGGGGGCTTT 4920 

GGACGTTGTT TTGGTTGTAC 4980 

TTTCTGTAAA ACAGGTTATA 5040 

GTCCCTAATG TGTTAGATTG 5100 

CTGTGTTACT AAAAAAAAGA 5160 
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TGAATGTCGG AAAAGGGTGT TGGGAGGGTG GTCAACAAAG AAACAAAGAT GTTATGGTGT 5220 
TTAGACTTAT GGTTGTTAAA AATGTCATCT CAAGTCAAGT CACTGGTCTG TTTGCATTTG 5280 
ATACATTTTT GTACTAACTA GCATTGTAAA ATTATTTCAT GATTAGAAAT TACCTGTGGA 5340 
TATTTGTATA AAAGTGTGAA ATAAATTTTT TATAAAAGTG TTCATTGTTT CGTAACACAC 5400 
CATTGTATAT GTGAAGCAAA CTCTAAAATT ATAAATGACA ACCTGAATTA TCTATTTCAT 5460 
CAAAAAAAAA AAAAAAAAAA ACTTTATGGG CACAACTGG 5499 
(2) INFORMATION FOR SEQ ID NO: 5: 

I 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1..141 

(D) OTHER INFORMATION: /note* "The 141 amino acid sequence 
predicted from the nucleic acid product which 
results from amplification of rhe mouse ALPHA 6B 
cDNA with primers 1157/1156." 

(ix) FEATURE: 

(A) NAME/KEY: Domain 

(B) LOCATION: 88., 113 

(D) OTHER INFORMATION: /note- "The putative transmembrane 
domain." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1. .120 

(D) OTHER INFORMATION: /note- "SEQ ID NO : 5 is identical to 
SEQ ID NO: 7 at amino acid position 1. through 120; 
the two sequences diverge at amino acid 121." 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Thr Leu Asn Cys Ser Val Asn VaL Arg Cys Val Asn He Arg Cys Pro 
1 5 10 : 15 

Leu Arg Gly Leu Asp Ser Lys Ala Ser Leu Val Leu Arg Ser Arg Leu 
20 25 30 

Trp Asn Ser Thr Phe Leu Glu Glu Tyr Ser Lys Leu Asn Tyr Leu Asp 
35 AO 45 

He Leu Leu Arg Ala Ser He Asp Val Thr Ala Ala Ala Gin Asn He 
50 55 60 

Lys Leu Leu Thr Ala Gly Thr Gin Val Arg Val Thr Val Phe Pro Ser 
65 70 75 80 

Lys Thr Val Ala Gin Tyr Ser Gly Val Ala Trp Trp He He Leu Leu 
85 90 95 

Ala Val Leu Ala Gly He Leu Met Leu Ala Leu Leu Val Phe Leu Leu 
100 " 105 HO 

Trp Lys Cys Gly Phe Phe Lys Arg Ser Arg Tyr Asp Asp Ser lie Pro 
115 120 125 

Are Tyr His Ala Val Arg He Arg Lys Glu Glu Arg Glu 
130 135 14° 

(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. .426 

(D) OTHER INFORMATION: /product- "Mouse ALPHA 6B amino 
acid sequence in SEQ ID NO: 5." 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 



WO 92/19647 



PCT/US92/03527 



89 

(B) LOCATION: 262. .337 

(D) OTHER INFORMATION: /function* "Putative transmembrane 
region. " 

(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: (342^343) 

(D) OTHER INFORMATION: /note= "SEQ ID NO: 6 is identical to 
SEQ ID NO: 8 except for 130 nucleotides present in 
SEQ ID NO: 8 but deleted between nucleotides 342 
and 343 of SEQ ID NO: 6." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GACTCTTAAC TGTAGCGTGA ACGTGAGGTG TGTGAACATC AGGTGCCCAC TGCGAGGGCT 60 

GGACAGCAAG GCCTCTCTCG TTCTTCGTTC CAGGTTGTGG AACAGCACAT TTCTAGAGGA 120 

AT ATT CC AAA CTGAACTACT TGGACATTCT CCTGAGGGCT TCCATAGATG TCACCGCTGC 180 

TGCTCAGAAT ATCAAGCTCC TCACCGCCGG CACTCAGGTT CGAGTGACGG TGTTTCCCTC 240 

AAAGACTGTA GCTCAGTATT CAGGAGTAGC TTGGTGGATC ATCCTCCTGG CTGTTCTTGC 300 

CGGGATTCTG ATGCTGGCTC TATTAGTGTT TTTACTGTGG AAGTGTGGAT TCTTTAAGCG 360 

CTCTAGGTAC GATGACAGCA TTCCCCGATA CCATGCGGTG CGGATCCGGA AAGAAGAGCG 420 
AGAGAT 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1. .149 

(D) OTHER INFORMATION: /note- "The 149 amino acid sequence 
predicted from the product which results from 
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amplification of the mouse ALPHA 6A cDNA with 
primers 1157/1156." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1..120 

(D) OTHER INFORMATION: /note«= "SEQ ID NO: 7 is identical to 
SEQ ID NO: 5 at amino acid positions 1 through 120; 
the sequences diverge at amino acid 121." 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Thr Leu Asn Cys Ser Val Asn Val Arg Cys Val Asn lie Arg Cys Pro 
1 5 10 15 

Leu Arg Gly Leu Asp Ser Lys Ala Ser Leu Val Leu Arg Ser Arg Leu 
20 25 3° 

Trp Asn Ser Thr Phe Leu Glu Glu Tyr Ser Lys Leu Asn Tyr Leu Asp 
35 40 « 

lie Leu Leu Arg Ala Ser lie Asp Val Thr Ala Ala Ala Gin Asn lie 
50 55 60 

Lys Leu Leu Thr Ala Gly Thr Gin Val Arg Val Thr Val Phe Pro Ser 
65 70 75 80 

Lys Thr Val Ala Gin Tyr Ser Gly Val Ala Trp Trp lie He Leu Leu 
85 90 95 

Ala Val Leu Ala Gly He Leu Met Leu Ala Leu Leu Val Phe Leu Leu 
100 * 105 HO 

Trp Lys Cys Gly Phe Phe Lys Arg Asn Lys Lys Asp His Tyr Asp Ala 
115 " 120 125 

Thr Tyr His Lys Ala Glu He His Thr Gin Pro Ser Asp Lys Glu Arg 
L30 ; 135 140. 

Leu Thr Ser Asp Ala 
145 

<2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 556 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .556 

(D) OTHER INFORMATION: /product= "Mouse ALPHA 6A amino 
acid sequence in SEQ ID NO: 7." 

/note= "SEQ ID NO: 8 is the 556 base nucleotide 
sequence corresponding to the mouse ALPHA 6A amino 
acid sequence SEQ ID NO: 7, plus the first 109 
nucleotides in the 3' noncoding region." 

(ix) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 342.. 472 

(D) OTHER INFORMATION: /note- "SEQ ID NO: 8 is identical to 
SEQ ID NO: 6 except it has a 130 base insertion 
(nucleotides 342-472 of SEQ ID NO: 8) between 
nucleotides 352 and 353 of SEQ ID NO: 6." 



(xi) SEQUENCE DESCRIPTION : . SEQ ID NO: 8: 



GACTCTTAAC 


TGTAGCGTGA 


ACGTGAGGTG 


TGTGAACATC 


AGGTGCCCAC 


TGCGAGGGCT 


60 


GGACAGCAAG 


GCCTCTCTCG 


TTCTTCGTTC 


CAGGTTGTGG 


AACAGCACAT 


TTCTAGAGGA 


120 


ATATTCCAAA 


CTGAACTACT 


TGGACATTCT 


CCTGAGGGCT 


TCCATAGATG 


TCACCGCTGC 


180 


TGCTCAGAAT 


ATCAAGCTCC 


TCACCGCCGG 


CACTCAGGTT 


CGAGTGACGG 


TGTTTCCCTC 


240 


AAAGACTGTA 


GCTCAGTATT 


CAGGAGTAGC 


TTGGTGGATC 


ATCCTCCTGG 


CTGTTCTTGC 


300 


CGGGATTCTG 


ATGCTGGCTC 


TATTAGTGTT 


TTTACTGTGG 


AAGTGTGGCT 


TCTTCAAGAG 


360 


AAATAAGAAA 


GATCATTACG 


ATGCCACCTA 


TCACAAGGCT 


GAGATCCATA 


CTCAGCCGTC 


420 


TGATAAAGAG 


AGGCTTACTT 


CCGATGCATA 


GTATTGATCT 


ACTTCCATAA 


TTGTGTGGAT 


480 


TCTTTAAGCG 


CTCTAGGTAC 


GATGACAGCA 


TTCCCCGATA 


CCATGCGGTG 


CGGATCCGGA 


540 


AAGAAGAGCG 


AGAGAT 










556 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 
(Hi) HYPOTHETICAL: YES 
(Iv) ANTI- SENSE: NO 
(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 1..153 

(D) OTHER INFORMATION: /note- "SEQ ID NO: 9 is the 153 
amino acid sequence predicted from the product 
which results from amplification of the mouse 
ALPHA 3B cDNA with primers 2032/2033." 

(ix) FEATURE: 

(A) NAME/KEY: Domain 

(B) LOCATION: 108.. 112 

(D) OTHER INFORMATION: /note- "The cytoplasmic sequence 
CDFFK begins at amino acid position 108." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Ala Arg Cys Val Trp Leu Glu Cys Pro Leu Pro Asp Thr Ser Asn He 
x ' ' 5 10 15 

Thr Asn Val Thr Val Lys Ala Arg Val Trp Asn Ser Thr Phe He Glu 
20 25 30 

Asp Tyr Lys Asp Phe Asp Arg Val Arg Val Asp Gly Trp Ala Thr Leu 
35 AO 45 

Phe Leu Arg Thr Ser He Pro Thr He Asn Met Glu Asn Lys Thr Thr 
50 55 60 

Cys Phe Ser Val Asn He Asp Ser Lys Leu Leu Glu Glu Leu Pro Ala 
65 70 75 80 

Glu He Glu Leu Trp Leu Val Leu Val Ala Val Gly Ala Gly Leu Leu 
85 90 95 

Leu Leu Gly Leu He He He Leu Leu Trp Lys Cys Asp Phe Phe Lys 
100 105 HO 

Pro Thr Arg Tyr Tyr Arg He Met Pro Lys Tyr His Ala Val Arg He 

120 I 25 
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Are Glu Glu Asp Arg Tyr Pro Pro Pro Gly Ser Thr Leu Pro Thr Lys 
130 135 140 

Lys His Trp Val Thr Ser Trp Gin He 
145 * 150 

(2) INFORMATION FOR SEQ ID NO : 10 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



<ix) FEATURE: 

' (A) NAME/KEY: mis cofeature 
(B) LOCATION: 1..463 

(D) OTHER INFORMATION: /product* "Mouse ALPHA 3B amino 
acid sequence, in SEQ ID NO: 9." 

/note= "SEQ ID NO: 10 is the 463 base nucleotide 
sequence corresponding to the mouse ALPHA 3B amino 
acid sequence in SEQ ID NO: 9." 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 324. .338 

(D) OTHER INFORMATION: /producr« "The cytoplasmic sequence 
CDFFK." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 10 : 



GTGCCCGCTG 


TGTGTGGCTG 


GAGTGCCCCC 


TTCCAGACAC 


CTCCAACATT 


ACCAATGTGA 


60 


CCGTGAAAGC 


ACGGGTGTGG 


AACAGCACCT 


TCATTGAGGA 


CTACAAAGAC 


TTTGACAGAG. 


. 120 


TCAGGGTAGA 


TGGCTGGGCT 


ACCCTGTTCC 


TGAGAACCAG 


CATCCCTACC 


ATC^ACATGG 


180 


AGAACAAGAC 


CACATGTTTC 


TCTGTGAACA 


TTGACTCAAA 


GCTGTTGGAG 


GAGCTGCCCG 


240 


CTGAGATTGA 


GCTGTGGTTG 


GTGCTTGTGG 


CCGTGGGTGC 


TGGGTTGCTG 


CTGCTGGGGC 


300 


T CATC AT CAT 


CCTCTTGTGG 


AAGTGTGACT 


TCTTTAAGCC 


GACCCGCTAC 


TACCGGATTA 


360 


TGCCCAAGTA 


CCATGCAGTG 


CGTATCCGGG 


AGGAGGACCG 


CTACCCACCT 


CCAGGGAGCA 


420 
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CGCTACCCAC CAAGAAGCAC TGGGTCACCA GCTGGCAGAT TCG 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .20 
(D) OTHER INFORMATION: /standard_name= "PCR PRIMER 1157- 

/note« "Primer corresponds to bp 2918-2937 of the 

ALPHA 6A cDNA sequence of SEQ ID NO: 2." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GACTCTTAAC TGTAGCGTGA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 1..20 
(D) OTHER INFORMATION: /standard_name= "PCR PRIMER 1156" 

/note= "The primer corresponds to the complement 
of bp 3454-3473 of the APHA 6A cDNA sequence of 
SEQ ID NO:2." 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ATCTCTCGCT CTTCTTTCCG 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .19 
(D) OTHER INFORMATION: /standard_name= "PCR PRIMER 1681" 

/note= "The primer corresponds to bp 2942-2960 of 
the ALPHA 6A cDNA sequence of SEQ ID NO: 2." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GAACTGTGTG AACATCAGA 19 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .20 
(D) OTHER INFORMATION : /s tandard_name= "PCR PRIMER 2002" 
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/note= "The primer corresponds ro the complement 
of bp 3433-3452 of the ALPHA 6A cDNA sequence or 
SEQ ID NO: 2." 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

20 

ATCCTTACAG catggtatcg 

<2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base, pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 1.-20 

CD) OTHER INFORMATION: /standard_name« "PCR PRIMER 2032" 
/note= "The primer corresponds to the hamster^ 
ALPHA 3A cDNA sequence of Tsuji et. al. , J - Biol. 
Chem. , 265:7016-7021 (1990)." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AAGCCAAATC TGAGACTGTG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(Ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 . . 20 

(D) OTHER INFORMATION:. /standard_name= "PCR PRIMER 2033" 
/note= ''The primer corresponds ro rhe hamster. 
ALPHA 3A cDNA sequence of Tsuji et al . , J. Biol. 
Chem. . 265:7016-7021 (1990)." 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GTAGTATCGG TCCCGAATCT 
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What Is Claimed Is: 

1. A polypeptide of about 24 to about 1091 
amino acid residues in length having a sequence that 
includes the a 6B cytoplasmic domain sequence shown in 
5 SEQ ID NO 3 from residue 1068 to residue 1091. 

2- The polypeptide of claim 1 having a sequence 
shown in SEQ ID NO 3 from residue 10 68 to residue 
1091. 

3. The polypeptide of claim 1 having a sequence 
10 shown in SEQ ID NO 3 from residue 1045 to residue 

1091. 

4. A polypeptide having an amino acid residue 
sequence shown in SEQ ID NO 3 from residue 1 to 
residue. 1091. 

15 5. A polypeptide of about 21 to about 141 amino 

acid residues in length having a sequence that 
includes the a 6B cytoplasmic domain sequence shown in 
SEQ ID, NO 5 from residue 121 to residue 141. . 

6. The polypeptide of claim 5 having a sequence 
20 shown in SEQ ID NO 5 from residue 121 to residue 141. 

7. A polypeptide of about 41 to about 153 amino 
acid residues in length having a sequence that 
includes the a 3B cytoplasmic domain sequence shown in 
SEQ ID NO 9 from residue 113 to residue 153. 

25 s. The polypeptide of claim 7 having a sequence 

shown in SEQ ID NO 9 from residue 113 to residue 153. 

9. An antibody molecule that immunoreacts with 
the a 6B protein and with a polypeptide having an amino 
acid residue sequence shown in SEQ ID NO 3 from 

30 residue 1068-1091. 

10. The antibody . molecule of claim 9 wherein 
said antibody molecule is a monoclonal antibody 
molecule. 

11. An antibody molecule that immunoreacts with 
35 the a 6B protein and with a polypeptide having an amino 
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acid residue sequence shown in SEQ ID NO 3 from 
residue 1045 to residue 1091. 

12. The antibody molecule of claim 11 wherein 
said antibody molecule is a monoclonal antibody 

5 molecule. 

13. An antibody molecule that immunoreacts with 
the a 6B protein and with a polypeptide having an 
amino acid residue sequence shown in SEQ ID NO 5 from 
residue 121 to residue 141. 

10 14. The antibody molecule of claim 13 wherein 

said antibody molecule is a monoclonal antibody 
molecule. 

15. An antibody molecule that immunoreacts with 
the a 3B protein and with a polypeptide having an amino 

15 acid residue sequence shown in SEQ ID NO 9 from 

residue 113 to residue 153. 

16. The antibody molecule of claim 15 wherein 
said antibody molecule is a monoclonal antibody 
molecule. 

20 17. A method for detecting the presence of 

antigen having the cytoplasmic domain of a 6B in a body 
sample comprising the steps of: 

a) admixing the body sample with a 
composition containing antibody molecules that 
25 immunoreact with the a 6B protein and with a 

polypeptide consisting essentially of an amino acid 
residue sequence shown in SEQ ID NO 3 from residue 
1045 to residue 1091 to form an immunoreaction 
admixture; 

30 b) maintaining said immunoreaction 

admixture under immunoreaction conditions for a time 
period sufficient for said antibody molecules to 
immunoreact with any a 6B present in said body sample 
and form an immunoreaction complex; and 

3 5 c) detecting the presence of any 

immunoreaction complex formed in step (b) and thereby 
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detecting the presence of said antigen in said body 
sample. 

18. The method of claim 17 wherein said 
detecting in step (c) comprises the steps of: 

5 (i) admixing said immunoreaction 

product formed in step (b) with an indicating means to 
form a second reaction admixture; 

(ii) maintaining said second reaction 
admixture for a time period sufficient for said 

10 indicating means to bind to the immunoreaction product 

formed in step (b) and form a second reaction product; 
and 

(iii) determining the presence of 
said indicating means in said second reaction product, 

15 and thereby the presence of said immunoreaction 

product formed in step (b) . 

19. The method of claim 18 wherein said 
indicating means is a labeled antibody comprising an 
antibody having a label affixed thereto. 

20 20. The method of claim 17 wherein said sample 

is a fluid sample, said admixing in step (a) includes 
admixing said body fluid sample and said antibody 
composition with a solid support comprising a solid 
matrix having affixed thereto a polypeptide having an 

25 amino acid residue sequence that includes an amino 

acid residue sequence shown in SEQ ID NO 3 from 
residue 1068 to residue 1091 such that said 
immunoreaction admixture is a competition 
immunoreaction admixture having a liquid phase and a 

30 solid phase, and said immunoreaction product formed in 

step (c) is in the solid phase. 

21. The method of claim 20 wherein said antibody 
is a labeled antibody, having a label affixed to the 
antibody. 

35 22. The method of claim 21 wherein said 

detecting in step (c) comprises determining the 
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presence of said label in the solid phase 
immunoreaction product, and thereby the presence of 
said immunoreaction product. 

23. The method of claim 120 wherein said 
polypeptide has an amino acid residue sequence shown 
in SEQ ID NO 3 from residue 1045 to residue 1091. 

24. The method of claim 17 wherein said sample 
is a fluid sample and said antibody molecules are 
affixed to a solid support such that said 
inununoreaction admixture is a competition 
inununoreaction admixture having a liquid phaLse and a 
solid phase, and said inununoreaction product formed in 
step (c) is in the solid phase. 

25. The method of claim 24 wherein said admixing 
in step (b) includes admixing said body fluid sample 
and said solid-phase antibody composition with a 
polypeptide having an amino acid residue sequence that 
includes an amino acid residue sequence shown in SEQ 
ID NO 3 from residue 1068 to residue 1091 such that 
said inununoreaction admixture is a competition 
inununoreaction admixture. 

26. The method of claim 25 wherein said 
polypeptide is a labeled polypeptide, having a label 
affixed to the polypeptide. 

27. A method for detecting the presence of 
antigen having the cytoplasmic domain of a 6B in a body 
sample comprising the steps of: 

a) admixing the body sample with a 
composition containing antibody molecules that 
immunoreact with the cr 6B protein and with a 
polypeptide having an amino acid residue sequence 
shown in SEQ ID NO 5 from residue 121 to residue 141 
to form an immunoreaction admixture; 

b) maintaining said immunoreaction 
admixture under immunoreaction conditions for a time 
period sufficient for said antibody molecules to 
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immunoreact with any ct 6B present in said body sample 
and form an immunoreaction complex; and 

c) detecting the presence of any 
immunoreaction complex formed in step (b) and thereby, 
5 detecting the presence -of said antigen in said body 

sample. 

28. A method for detecting the presence of 
antigen having the cytoplasmic domain of a 3B in a body 
sample comprising the steps of: 

10 a) admixing the body sample with a 

composition containing antibody molecules that 
immunoreact with the o 3B protein and a polypeptide 
having an amino acid residue sequence shown in SEQ ID 
NO 9 from residue 113 to residue 153 to form an 

15 immunoreaction admixture; 

b) maintaining said immunoreaction 
admixture under immunoreaction conditions for a time 
period sufficient for said antibody molecules to 
immunoreact with any a 3B present in said body sample 

20 and form an immunoreaction complex; and 

c) detecting the presence of any 
immunoreaction complex formed in step (b) and thereby 
detect the presence of said antigen in said body 
sample. 

25 29. A diagnostic system in kit form for assaying 

for the presence of a 6B subunit in a body sample, 
comprising a package containing, in an amount 
sufficient to perform at least one assay, an antibody 
composition comprising antibody molecules that 

30 immunoreact with the a 6B protein and with a 

polypeptide having an amino acid residue sequence 
shown in SEQ ID NO 3 from residue 1045 to residue 
1091. 

30. The diagnostic system of claim 29 wherein 
35 said antibody is affixed to a solid matrix. 
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31- The diagnostic system of claim 29 that 
further includes a solid support comprised of a solid 
matrix having affixed thereto a polypeptide having an 
amino acid residue sequence that includes an amino 
acid residue sequence shown in SEQ ID NO 3 from 
residue 1068 to residue 1091, 

32. The diagnostic system of claim 31 that 
further includes, in a separate package, labeled 
specific binding agent for signaling the presence of 
an immunoreaction product in the solid phase. 

33. A diagnostic system in kit form for assaying 
for the presence of a 6B subunit in a body sample, 
comprising a package containing, in an amount 
sufficient to perform at least one assay, an antibody 
composition comprising antibody molecules that 
immunoreact with the ce 6B protein and with a 
polypeptide having an amino acid residue sequence 
shown in SEQ ID NO 5 from residue 121 to residue 141. 

34. A diagnostic system in kit form for assaying 
for the presence of or 3B subunit in a body sample, 
comprising separate packages containing, in an amount 
sufficient to perform at least one assay, an antibody 
composition comprising antibody molecules that 
immunoreact with the cr 3B protein and with a 
polypeptide having an amino acid residue sequence 
shown in SEQ ID NO 9 from residue 113 to residue 153. 

35. The diagnostic system of claim 32 that 
further includes a solid support comprised of a solid 
matrix having said antibody molecules affixed thereto* 
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° 6B J, c^ggcc^a^gcggt^ "« 

30 „ CAAACTGAACTACTTGGACATTCTCAJG^ »» • 
S1 .4 AAATATCAGGCTGCCAAATGCAGGGACrCA^^ 
„„ TGTAGCTCACTATTCGGCAGTACCTTGGT^T^^ 

141 I^ga^U^a^^ 

CTTGATGCTTGCTTTATTAGTGTTTATACTATtKAAGTGTGGTTTCTTCAAGAGAAATAA J2M 

301 iliUUciUcTTTXrtAGTGirrATACTWGGAAG 

3 28 < CAAAGATCATTATGATGCCAGAIATCACAAG^GAGATCCATGC^^ »« 

••• * * <2 

361 

33„ AGAGAGGCTTACTTCTGATGCATAGTATTGATCTACTTCTGTAATTGTGTGGATTCTTTA 3.03 

TGTGGATTCT7TA «B0 
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oactcttftactgt agcgt gaacgtgaggtgtgtgaacatcaggtgcccact gcgagggct 2 97 6 
gact cttaactgt agcgt g&acgtgaggtgt gtgaacatcaggtgcccact gcgagggct 
T L N C S V N V R C V N IRC P L R G L 

ggacagcaaggcctctctcgttcttcgttccaggttgtggaacagcacat 1 1 ctagagga 303 6 
ggacagcaaggcctctctcgttctt cgtt ccaggttgtggaacagcacatttctagagga 
"OSKASL'V'LRSRLKMSTFLEE 

atattccaaactgaactacttggacattctcctgagggcttccatagatgtcaccgctgc 309 6 
atattccaaactgaactacttggacatt ct cctgagggctt ccatagatgt caccgctgc 
YSKLNYLDI LLR~ASIDVTAA 

tgctcagaatatcaagctcct caccgccggcactcaggttcgagtgacggtgttt ccctc 315 6 
tgctcagaatatcaagct cctcaccgccggcactcaggttcgagtgacggtgtttccct<: 
A Q N I K L L T A G T Q V R V T V F P S 

aaagactgtagctcagtattcaggagtagcttggtggatcatcctcctggctgttcttgc 321 6 
aaagactgtagctcagtattcaggagtagcttggtggatcatcctcctggctgttcttgc 
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